Fun-ASR-Nano · GGUF (FunASR llama.cpp runtime)
GGUF build of Fun-ASR-Nano (SenseVoice SAN-M encoder + adaptor + Qwen3-0.6B LLM decoder) for the zero-Python, CPU/edge FunASR llama.cpp runtime — the accuracy leader (LLM decoder), single C++ binary.
Files
| file | size | notes |
|---|---|---|
funasr-encoder-f16.gguf |
470 MB | audio encoder + adaptor (f16) |
qwen3-0.6b-q8_0.gguf |
805 MB | LLM decoder, recommended (Q8_0) |
qwen3-0.6b-q4km.gguf |
484 MB | LLM decoder, smaller (Q4_K_M) |
Usage (needs both the encoder and the LLM gguf)
llama-funasr-cli --enc funasr-encoder-f16.gguf -m qwen3-0.6b-q8_0.gguf -a audio.wav --vad fsmn-vad.gguf
On CPU: 8.30 % CER on the 184-clip Mandarin benchmark (vs whisper.cpp 22–31 %).
Links
- 🧩 Runtime & build: Fun-ASR · runtime/llama.cpp — ⭐ Star Fun-ASR!
- Source model: FunAudioLLM/Fun-ASR-Nano-2512
- Downloads last month
- -
Hardware compatibility
Log In to add your hardware