Qwen3-ASR 1.7B Β· OpenASR
State-of-the-art multilingual speech recognition across 52 languages & dialects
Native speech-to-text in the OpenASR runtime β engineered for peak performance on CPU & GPU, no Python at inference time.
β¨ Highlights
- π SOTA among open-source ASR β competitive with the strongest proprietary commercial APIs
- π 52 languages & dialects β 30 languages plus 22 Chinese dialects, with spoken-language identification
- π§ Robust on hard audio β clean speech, singing voice, and songs over background music
- π¦ Native in OpenASR β
.oasrpacks run with no Python at inference, engineered for peak performance on CPU & GPU
π Quickstart
# 1. Install the OpenASR CLI Β· https://openasr.org
# 2. Pull a build (pick a quant β see the table below)
openasr pull qwen3-asr-1.7b:q8
# 3. Transcribe
openasr transcribe audio.wav --model qwen3-asr-1.7b
All builds for this model:
openasr pull qwen3-asr-1.7b:fp16
openasr pull qwen3-asr-1.7b:q8
openasr pull qwen3-asr-1.7b:q4
π¦ Available builds
| Quant | File (.oasr) |
Size | RAM peak | RTF Β· M1 CPU | RTF Β· M1 GPU | JFK ΞWER vs fp16 |
|---|---|---|---|---|---|---|
| fp16 | qwen3-asr-1.7b-fp16.oasr |
4.70 GB | 6.47 GB | 2.95Γ | 1.61Γ | 0.0% |
| q8_0 | qwen3-asr-1.7b-q8_0.oasr |
2.51 GB | 5.05 GB | 2.84Γ | 0.88Γ | 0.0% |
| q4_k | qwen3-asr-1.7b-q4_k.oasr |
1.33 GB | 4.80 GB | 3.47Γ | 0.56Γ | 0.0% |
RTF = real-time factor on the fixed 11s JFK clip (lower is faster); RAM peak measured per pack in an isolated subprocess. JFK ΞWER compares each quantized build's JFK transcript to this model's fp16 JFK transcript, so it measures quantization drift rather than absolute recognition accuracy. q8_0 is the recommended default β near-reference quality at a fraction of the footprint.
π§ About Qwen3-ASR 1.7B
Qwen3-ASR-1.7B is a multilingual automatic speech recognition model (~2B parameters,
~1.7B active, BF16) from Alibaba's Qwen3-ASR family that transcribes speech while
identifying the spoken language across 30 languages, 22 Chinese dialects, and a range
of regional English accents β and it holds up on hard audio including singing voice and
songs over background music. A single unified checkpoint serves both offline and
real-time streaming transcription, with word-level timestamps available via the
companion Qwen3-ForcedAligner-0.6B; the Qwen team reports state-of-the-art quality among
open-source ASR models and accuracy competitive with commercial APIs. This OpenASR repo
repackages the original Qwen/Qwen3-ASR-1.7B weights as .oasr packs that run natively in
the OpenASR runtime with no Python at inference time. For most users the q8_0 build is the
recommended default β near-reference accuracy at roughly half the footprint β while q4_k
suits tight-memory deployments and fp16 is reserved for verification or maximum fidelity.
βοΈ How these packs were made
Converted from Qwen/Qwen3-ASR-1.7B with the OpenASR importer:
openasr model-pack import-qwen-local <src> <out>.oasr \
--package-id qwen3-asr-1.7b --quantization {fp16,q8-0,q4-k}
The .oasr container is GGUF-backed; packs use zero-copy mmap weight binding and graph
buffer reuse to keep peak memory low.
βοΈ License
These packs inherit the upstream model's license: Apache-2.0 (source). OpenASR packaging retains the upstream copyright and NOTICE; the only modifications are format conversion and quantization.
π Acknowledgements
This pack is a redistribution of Qwen3-ASR-1.7B, created and open-sourced by the Qwen team at Alibaba (Qwen/Qwen3-ASR-1.7B). All credit for the original architecture, training, and weights belongs to them; the license is inherited from and identical to the upstream model (Apache-2.0). Word-level timestamps additionally require the companion Qwen3-ForcedAligner-0.6B. The GGUF quantization recipe and bit-identity verification methodology were informed by cstr/qwen3-asr-1.7b-GGUF. Thank you to both teams for releasing their work openly.
π Links
- π¦ OpenASR β https://github.com/QuintinShaw/openasr
- π Website β https://openasr.org
- π€ Upstream model β Qwen/Qwen3-ASR-1.7B
Model tree for OpenASR/qwen3-asr-1.7b
Base model
Qwen/Qwen3-ASR-1.7B