WeSpeaker ResNet34 Speaker Embedder (VoxCeleb) Β· OpenASR
WeSpeaker ResNet34 speaker embedder for OpenASR diarization β stronger speaker separation, fully on-device
Speaker-diarization support pack for the OpenASR runtime β pure-Rust inference, no Python at inference time.
β¨ Highlights
- π£οΈ Default OpenASR speaker embedder β install this pack and
--diarizelabels anonymous speakers for any ASR family - 𧬠256-dim ResNet34 embeddings β pyannote's WeSpeaker VoxCeleb model with temporal statistics pooling
- π Diarization, not identification β anonymous session-relative labels; embeddings stay local and are discarded after the request unless you explicitly enroll a local profile
- π― Parity-gated packaging β single raw-f32 build; the Rust forward pass matches the upstream pyannote reference at cosine β₯ 0.9999
- π¦ Native in OpenASR β
.oasrpacks run with no Python at inference, engineered for peak performance on CPU & GPU
π Quickstart
# 1. Install the OpenASR CLI Β· https://openasr.org
# 2. Pull the pack
openasr pull wespeaker-voxceleb-resnet34-lm:f32
# 3. Diarize any transcription (works with every OpenASR ASR model)
openasr transcribe meeting.wav --model xasr-zh-en --diarize --format srt
π¦ Pack
| Quant | File (.oasr) |
Size |
|---|---|---|
| f32 | wespeaker-voxceleb-resnet34-lm-f32.oasr |
27 MB |
Single raw-f32 build: the pure-Rust forward pass consumes f32 directly and the parity gates assert bit-exact outputs vs the upstream weights, so no integer quantization is produced.
π§ About WeSpeaker ResNet34 Speaker Embedder (VoxCeleb)
WeSpeaker ResNet34 is the speaker-embedding model used by pyannote for VoxCeleb speaker
representations. OpenASR packages it as a local .oasr capability pack and uses it as the default
speaker-embedding stage for diarization: speech regions are embedded, clustered, and attributed
back onto transcripts from any ASR model. CAM++ remains available as a compatibility pack and as the
fast realtime change-detector embedder when both packs are installed.
βοΈ How this pack was made
Converted from pyannote/wespeaker-voxceleb-resnet34-LM with the OpenASR importer:
openasr model-pack import-wespeaker-local <src>.safetensors <out>.oasr \
--package-id wespeaker-voxceleb-resnet34-lm
The .oasr container is GGUF-backed; every tensor is stored as raw f32 so the
pack round-trips bit-identically against the source weights.
βοΈ License
This pack inherits the upstream model's license: CC-BY-4.0 (source). OpenASR packaging retains the upstream copyright; the only modification is format conversion.
π Acknowledgements
This pack redistributes pyannote/wespeaker-voxceleb-resnet34-LM in OpenASR's .oasr runtime
format. Credit for the model architecture, training, and original weights belongs to the upstream
pyannote/WeSpeaker authors. The upstream model is licensed under CC-BY-4.0; OpenASR packaging
retains that license and attribution, with the only modification being format conversion for local
runtime loading.
π Links
- π¦ OpenASR β https://github.com/QuintinShaw/OpenASR
- π Website β https://openasr.org
- π€ Upstream model β pyannote/wespeaker-voxceleb-resnet34-LM
Model tree for OpenASR/wespeaker-voxceleb-resnet34-lm
Base model
pyannote/wespeaker-voxceleb-resnet34-LM