Instructions to use Recogment/sortformer-4spk-v2-onnx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- NeMo
How to use Recogment/sortformer-4spk-v2-onnx with NeMo:
import nemo.collections.asr as nemo_asr asr_model = nemo_asr.models.ASRModel.from_pretrained("Recogment/sortformer-4spk-v2-onnx") transcriptions = asr_model.transcribe(["file.wav"]) - Notebooks
- Google Colab
- Kaggle
Sortformer 4-speaker streaming diarizer (ONNX) β Recogment mirror
Streaming Transformer-based speaker diarization, fixed at 4 simultaneous speakers per chunk. This repo packages ONNX exports of the upstream NVIDIA NeMo checkpoint so that runtimes which don't depend on PyTorch / NeMo can use the model directly.
β οΈ Unofficial. This is a community-maintained mirror of derived artefacts. For the canonical model, citation, and training details see nvidia/diar_streaming_sortformer_4spk-v2.
Files
| File | Purpose | Size |
|---|---|---|
diar_streaming_sortformer_4spk-v2.onnx |
Dynamic-axis ONNX (streaming, sub-chunk inference) | ~470 MB |
diar_streaming_sortformer_4spk-v2-static.onnx |
Static-axis ONNX (full-chunk inference) | ~480 MB |
Both files contain the same model weights β the difference is only in the ONNX graph's batch / time-axis declarations. The dynamic variant is what you use for sub-chunk streaming; the static variant gives onnxruntime more room to optimise when you have a fixed 30s window.
Provenance
These ONNX files were exported from the upstream NeMo .nemo checkpoint of nvidia/diar_streaming_sortformer_4spk-v2. No quantization, retraining, or fine-tuning β weights round-trip bit-for-bit through the export.
Why this mirror exists
The Recogment daemon ships with a pinned set of model SHA-256 hashes and a zero-egress sandbox; the only outbound network in the whole product is a companion downloader binary that fetches from public HTTPS sources. Until this mirror existed, the Sortformer files had to be shipped to beta testers out-of-band as a tarball. With this mirror in place, the downloader can fetch them automatically alongside the rest of the public catalogue.
If you're not building Recogment, you almost certainly want the official NVIDIA repo linked above instead β it includes the original .nemo checkpoint, configuration, and reference NeMo inference code.
License
This work is licensed under CC-BY-4.0, matching the upstream NVIDIA repo. Attribution: NVIDIA Corporation, via nvidia/diar_streaming_sortformer_4spk-v2.
Citation
Please cite the upstream model paper, not this mirror:
@misc{nvidia2025sortformerv2,
title={Streaming Sortformer Diarizer 4spk v2},
author={NVIDIA},
year={2025},
publisher={Hugging Face},
howpublished={\url{https://huggingface.co/nvidia/diar_streaming_sortformer_4spk-v2}}
}
Model tree for Recogment/sortformer-4spk-v2-onnx
Base model
nvidia/diar_streaming_sortformer_4spk-v2