Spaces:
Configuration error
Configuration error
Update organization README
Browse files
README.md
CHANGED
|
@@ -131,7 +131,7 @@ graph.hard_reset();
|
|
| 131 |
- All weights INT4 quantised
|
| 132 |
- LFM: 1k-prefill / 100-decode, values are prefill tps / decode tps
|
| 133 |
- LFM-VL: 256px input, values are latency / decode tps
|
| 134 |
-
- Parakeet:
|
| 135 |
- Missing latency = no NPU support yet
|
| 136 |
|
| 137 |
| Device | LFM 1.2B | LFMVL 1.6B | Parakeet 1.1B | RAM |
|
|
@@ -146,6 +146,50 @@ graph.hard_reset();
|
|
| 146 |
| CMF Phone 2 Pro | - | - | - | - |
|
| 147 |
| Raspberry Pi 5 | 69/11 | 13.3s/11 | 4.5s/180k+ | 869MB |
|
| 148 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 149 |
## Roadmap
|
| 150 |
|
| 151 |
| Date | Status | Milestone |
|
|
@@ -228,38 +272,6 @@ graph.hard_reset();
|
|
| 228 |
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 229 |
```
|
| 230 |
|
| 231 |
-
## Supported Models
|
| 232 |
-
|
| 233 |
-
| Model | Features |
|
| 234 |
-
|-------|----------|
|
| 235 |
-
| google/gemma-3-270m-it | completion |
|
| 236 |
-
| google/functiongemma-270m-it | completion, tools |
|
| 237 |
-
| LiquidAI/LFM2-350M | completion, tools, embed |
|
| 238 |
-
| Qwen/Qwen3-0.6B | completion, tools, embed |
|
| 239 |
-
| LiquidAI/LFM2-700M | completion, tools, embed |
|
| 240 |
-
| LiquidAI/LFM2-8B-A1B | completion, tools, embed |
|
| 241 |
-
| google/gemma-3-1b-it | completion |
|
| 242 |
-
| LiquidAI/LFM2-1.2B | completion, tools, embed |
|
| 243 |
-
| LiquidAI/LFM2-1.2B-Tool | completion, tools, embed |
|
| 244 |
-
| LiquidAI/LFM2-1.2B-RAG | completion, tools, embed |
|
| 245 |
-
| LiquidAI/LFM2.5-1.2B-Thinking | completion, tools, embed |
|
| 246 |
-
| LiquidAI/LFM2.5-1.2B-Instruct | completion, tools, embed |
|
| 247 |
-
| Qwen/Qwen3-1.7B | completion, tools, embed |
|
| 248 |
-
| LiquidAI/LFM2-2.6B | completion, tools, embed |
|
| 249 |
-
| LiquidAI/LFM2-VL-450M | vision, txt & img embed, Apple NPU |
|
| 250 |
-
| LiquidAI/LFM2.5-VL-1.6B | vision, txt & img embed, Apple NPU |
|
| 251 |
-
| UsefulSensors/moonshine-base | transcription, speech embed |
|
| 252 |
-
| openai/whisper-tiny | transcription, speech embed, Apple NPU |
|
| 253 |
-
| openai/whisper-base | transcription, speech embed, Apple NPU |
|
| 254 |
-
| openai/whisper-small | transcription, speech embed, Apple NPU |
|
| 255 |
-
| openai/whisper-medium | transcription, speech embed, Apple NPU |
|
| 256 |
-
| nvidia/parakeet-ctc-0.6b | transcribe, speech embed, Apple NPU |
|
| 257 |
-
| nvidia/parakeet-ctc-1.1b | transcribe, speech embed, Apple NPU |
|
| 258 |
-
| nvidia/parakeet-tdt-0.6b-v3 | transcribe, speech embed, Apple NPU |
|
| 259 |
-
| snakers4/silero-vad | vad |
|
| 260 |
-
| nomic-ai/nomic-embed-text-v2-moe | embed |
|
| 261 |
-
| Qwen/Qwen3-Embedding-0.6B | embed |
|
| 262 |
-
|
| 263 |
## Maintaining Organisations
|
| 264 |
|
| 265 |
1. [Cactus Compute, Inc. (YC S25)](https://cactuscompute.com/)
|
|
|
|
| 131 |
- All weights INT4 quantised
|
| 132 |
- LFM: 1k-prefill / 100-decode, values are prefill tps / decode tps
|
| 133 |
- LFM-VL: 256px input, values are latency / decode tps
|
| 134 |
+
- Parakeet: 20s audio input, values are latency / decode tps
|
| 135 |
- Missing latency = no NPU support yet
|
| 136 |
|
| 137 |
| Device | LFM 1.2B | LFMVL 1.6B | Parakeet 1.1B | RAM |
|
|
|
|
| 146 |
| CMF Phone 2 Pro | - | - | - | - |
|
| 147 |
| Raspberry Pi 5 | 69/11 | 13.3s/11 | 4.5s/180k+ | 869MB |
|
| 148 |
|
| 149 |
+
## Supported Transcription Model
|
| 150 |
+
|
| 151 |
+
- STT: 20s audio input on Macbook Air M3 chip
|
| 152 |
+
- Benchmark dataset: internal evals with production users
|
| 153 |
+
|
| 154 |
+
| Model | Params | End2End ms | Latency ms | Decode toks/sec | NPU | RTF | WER |
|
| 155 |
+
|-------|--------|------------|------------|------------|-----|-----|-----|
|
| 156 |
+
| UsefulSensors/moonshine-base | 61M | 361.35 | 182 | 262 | yes | 0.0180 | 0.1395 |
|
| 157 |
+
| openai/whisper-tiny | 39M | 232.03 | 137.38 | 581 | yes | 0.0116 | 0.1860 |
|
| 158 |
+
| openai/whisper-base | 74M | 329.37 | 178.65 | 358 | yes | 0.0164 | 0.1628 |
|
| 159 |
+
| openai/whisper-small | 244M | 856.79 | 332.63 | 108 | yes | 0.0428 | 0.0930 |
|
| 160 |
+
| openai/whisper-medium | 769M | 2085.87 | 923.33 | 49 | yes | 0.1041 | 0.0930 |
|
| 161 |
+
| nvidia/parakeet-ctc-0.6b | 600M | 201.77 | 201.44 | 5214285 | yes | 0.0101 | 0.0930 |
|
| 162 |
+
| nvidia/parakeet-tdt-0.6b-v3 | 600M | 718.91 | 718.82 | 3583333 | no | 0.0359 | 0.0465 |
|
| 163 |
+
| nvidia/parakeet-ctc-1.1b | 1.1B | 279.03 | 278.92 | 4562500 | yes | 0.0139 | 0.1628 |
|
| 164 |
+
| snakers4/silero-vad | - | - | - | - | - | - | - |
|
| 165 |
+
|
| 166 |
+
## Supported LLMs
|
| 167 |
+
|
| 168 |
+
- Gemma weights are often **gated** on HuggingFace, needs tokens
|
| 169 |
+
- Run `hf auth login` and input your huggingface token
|
| 170 |
+
|
| 171 |
+
| Model | Features |
|
| 172 |
+
|-------|----------|
|
| 173 |
+
| google/gemma-3-270m-it | completion |
|
| 174 |
+
| google/functiongemma-270m-it | tools |
|
| 175 |
+
| google/gemma-3-1b-it | completion, gated |
|
| 176 |
+
| google/gemma-3n-E2B-it | completion, tools |
|
| 177 |
+
| google/gemma-3n-E4B-it | completion, tools |
|
| 178 |
+
| Qwen/Qwen3-0.6B | completion, tools, embed |
|
| 179 |
+
| Qwen/Qwen3-Embedding-0.6B | embed |
|
| 180 |
+
| Qwen/Qwen3.5-0.8B | vision, completion, tools, embed |
|
| 181 |
+
| Qwen/Qwen3-1.7B | completion, tools, embed |
|
| 182 |
+
| Qwen/Qwen3.5-2B | vision, completion, tools, embed |
|
| 183 |
+
| LiquidAI/LFM2-350M | completion, tools, embed |
|
| 184 |
+
| LiquidAI/LFM2-700M | completion, tools, embed |
|
| 185 |
+
| LiquidAI/LFM2-8B-A1B | completion, tools, embed |
|
| 186 |
+
| LiquidAI/LFM2.5-1.2B-Thinking | completion, tools, embed |
|
| 187 |
+
| LiquidAI/LFM2.5-1.2B-Instruct | completion, tools, embed |
|
| 188 |
+
| LiquidAI/LFM2-2.6B | completion, tools, embed |
|
| 189 |
+
| LiquidAI/LFM2-VL-450M | vision, txt & img embed, Apple NPU |
|
| 190 |
+
| LiquidAI/LFM2.5-VL-1.6B | vision, txt & img embed, Apple NPU |
|
| 191 |
+
| nomic-ai/nomic-embed-text-v2-moe | embed |
|
| 192 |
+
|
| 193 |
## Roadmap
|
| 194 |
|
| 195 |
| Date | Status | Milestone |
|
|
|
|
| 272 |
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 273 |
```
|
| 274 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 275 |
## Maintaining Organisations
|
| 276 |
|
| 277 |
1. [Cactus Compute, Inc. (YC S25)](https://cactuscompute.com/)
|