jakmro commited on
Commit
b3311a6
Β·
verified Β·
1 Parent(s): 2aab737

Update organization README

Browse files
Files changed (1) hide show
  1. README.md +45 -33
README.md CHANGED
@@ -131,7 +131,7 @@ graph.hard_reset();
131
  - All weights INT4 quantised
132
  - LFM: 1k-prefill / 100-decode, values are prefill tps / decode tps
133
  - LFM-VL: 256px input, values are latency / decode tps
134
- - Parakeet: 30s audio input, values are latency / decode tps
135
  - Missing latency = no NPU support yet
136
 
137
  | Device | LFM 1.2B | LFMVL 1.6B | Parakeet 1.1B | RAM |
@@ -146,6 +146,50 @@ graph.hard_reset();
146
  | CMF Phone 2 Pro | - | - | - | - |
147
  | Raspberry Pi 5 | 69/11 | 13.3s/11 | 4.5s/180k+ | 869MB |
148
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
149
  ## Roadmap
150
 
151
  | Date | Status | Milestone |
@@ -228,38 +272,6 @@ graph.hard_reset();
228
  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
229
  ```
230
 
231
- ## Supported Models
232
-
233
- | Model | Features |
234
- |-------|----------|
235
- | google/gemma-3-270m-it | completion |
236
- | google/functiongemma-270m-it | completion, tools |
237
- | LiquidAI/LFM2-350M | completion, tools, embed |
238
- | Qwen/Qwen3-0.6B | completion, tools, embed |
239
- | LiquidAI/LFM2-700M | completion, tools, embed |
240
- | LiquidAI/LFM2-8B-A1B | completion, tools, embed |
241
- | google/gemma-3-1b-it | completion |
242
- | LiquidAI/LFM2-1.2B | completion, tools, embed |
243
- | LiquidAI/LFM2-1.2B-Tool | completion, tools, embed |
244
- | LiquidAI/LFM2-1.2B-RAG | completion, tools, embed |
245
- | LiquidAI/LFM2.5-1.2B-Thinking | completion, tools, embed |
246
- | LiquidAI/LFM2.5-1.2B-Instruct | completion, tools, embed |
247
- | Qwen/Qwen3-1.7B | completion, tools, embed |
248
- | LiquidAI/LFM2-2.6B | completion, tools, embed |
249
- | LiquidAI/LFM2-VL-450M | vision, txt & img embed, Apple NPU |
250
- | LiquidAI/LFM2.5-VL-1.6B | vision, txt & img embed, Apple NPU |
251
- | UsefulSensors/moonshine-base | transcription, speech embed |
252
- | openai/whisper-tiny | transcription, speech embed, Apple NPU |
253
- | openai/whisper-base | transcription, speech embed, Apple NPU |
254
- | openai/whisper-small | transcription, speech embed, Apple NPU |
255
- | openai/whisper-medium | transcription, speech embed, Apple NPU |
256
- | nvidia/parakeet-ctc-0.6b | transcribe, speech embed, Apple NPU |
257
- | nvidia/parakeet-ctc-1.1b | transcribe, speech embed, Apple NPU |
258
- | nvidia/parakeet-tdt-0.6b-v3 | transcribe, speech embed, Apple NPU |
259
- | snakers4/silero-vad | vad |
260
- | nomic-ai/nomic-embed-text-v2-moe | embed |
261
- | Qwen/Qwen3-Embedding-0.6B | embed |
262
-
263
  ## Maintaining Organisations
264
 
265
  1. [Cactus Compute, Inc. (YC S25)](https://cactuscompute.com/)
 
131
  - All weights INT4 quantised
132
  - LFM: 1k-prefill / 100-decode, values are prefill tps / decode tps
133
  - LFM-VL: 256px input, values are latency / decode tps
134
+ - Parakeet: 20s audio input, values are latency / decode tps
135
  - Missing latency = no NPU support yet
136
 
137
  | Device | LFM 1.2B | LFMVL 1.6B | Parakeet 1.1B | RAM |
 
146
  | CMF Phone 2 Pro | - | - | - | - |
147
  | Raspberry Pi 5 | 69/11 | 13.3s/11 | 4.5s/180k+ | 869MB |
148
 
149
+ ## Supported Transcription Model
150
+
151
+ - STT: 20s audio input on Macbook Air M3 chip
152
+ - Benchmark dataset: internal evals with production users
153
+
154
+ | Model | Params | End2End ms | Latency ms | Decode toks/sec | NPU | RTF | WER |
155
+ |-------|--------|------------|------------|------------|-----|-----|-----|
156
+ | UsefulSensors/moonshine-base | 61M | 361.35 | 182 | 262 | yes | 0.0180 | 0.1395 |
157
+ | openai/whisper-tiny | 39M | 232.03 | 137.38 | 581 | yes | 0.0116 | 0.1860 |
158
+ | openai/whisper-base | 74M | 329.37 | 178.65 | 358 | yes | 0.0164 | 0.1628 |
159
+ | openai/whisper-small | 244M | 856.79 | 332.63 | 108 | yes | 0.0428 | 0.0930 |
160
+ | openai/whisper-medium | 769M | 2085.87 | 923.33 | 49 | yes | 0.1041 | 0.0930 |
161
+ | nvidia/parakeet-ctc-0.6b | 600M | 201.77 | 201.44 | 5214285 | yes | 0.0101 | 0.0930 |
162
+ | nvidia/parakeet-tdt-0.6b-v3 | 600M | 718.91 | 718.82 | 3583333 | no | 0.0359 | 0.0465 |
163
+ | nvidia/parakeet-ctc-1.1b | 1.1B | 279.03 | 278.92 | 4562500 | yes | 0.0139 | 0.1628 |
164
+ | snakers4/silero-vad | - | - | - | - | - | - | - |
165
+
166
+ ## Supported LLMs
167
+
168
+ - Gemma weights are often **gated** on HuggingFace, needs tokens
169
+ - Run `hf auth login` and input your huggingface token
170
+
171
+ | Model | Features |
172
+ |-------|----------|
173
+ | google/gemma-3-270m-it | completion |
174
+ | google/functiongemma-270m-it | tools |
175
+ | google/gemma-3-1b-it | completion, gated |
176
+ | google/gemma-3n-E2B-it | completion, tools |
177
+ | google/gemma-3n-E4B-it | completion, tools |
178
+ | Qwen/Qwen3-0.6B | completion, tools, embed |
179
+ | Qwen/Qwen3-Embedding-0.6B | embed |
180
+ | Qwen/Qwen3.5-0.8B | vision, completion, tools, embed |
181
+ | Qwen/Qwen3-1.7B | completion, tools, embed |
182
+ | Qwen/Qwen3.5-2B | vision, completion, tools, embed |
183
+ | LiquidAI/LFM2-350M | completion, tools, embed |
184
+ | LiquidAI/LFM2-700M | completion, tools, embed |
185
+ | LiquidAI/LFM2-8B-A1B | completion, tools, embed |
186
+ | LiquidAI/LFM2.5-1.2B-Thinking | completion, tools, embed |
187
+ | LiquidAI/LFM2.5-1.2B-Instruct | completion, tools, embed |
188
+ | LiquidAI/LFM2-2.6B | completion, tools, embed |
189
+ | LiquidAI/LFM2-VL-450M | vision, txt & img embed, Apple NPU |
190
+ | LiquidAI/LFM2.5-VL-1.6B | vision, txt & img embed, Apple NPU |
191
+ | nomic-ai/nomic-embed-text-v2-moe | embed |
192
+
193
  ## Roadmap
194
 
195
  | Date | Status | Milestone |
 
272
  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
273
  ```
274
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
275
  ## Maintaining Organisations
276
 
277
  1. [Cactus Compute, Inc. (YC S25)](https://cactuscompute.com/)