hicustomer
/

pyannote-speaker-diarization

Automatic Speech Recognition

pyannote-audio-pipeline

speaker-diarization

speaker-change-detection

voice-activity-detection

overlapped-speech-detection

Inference Endpoints

Model card Files Files and versions Community

Hervé Bredin commited on Dec 17, 2021

Commit

89a2e1b

•

1 Parent(s): aaad0e6

feat: initial import

Files changed (2) hide show

README.md +57 -0
config.yaml +18 -0

README.md ADDED Viewed

	@@ -0,0 +1,57 @@

+---
+tags:
+- pyannote
+- audio
+- voice
+- speech
+- speaker
+- speaker-diarization
+- speaker-change-detection
+- voice-activity-detection
+- overlapped-speech-detection
+datasets:
+- ami
+- dihard
+- voxconverse
+- voxceleb
+license: mit
+inference: false
+---
+# [pyannote.audio](https://github.com/pyannote/pyannote-audio) // speaker diarization
+```python
+from pyannote.audio import Pipeline
+pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization")
+output = pipeline("audio.wav")
+for speech_turn, _, speaker in output.itertracks():
+    print(f"Speaker '{speaker}' speaks between t={speech_turn.start}s and t={speech_turn.end}s.")
+```
+## Benchmark
+| Dataset                                                                                             | [Diarization error rate](http://pyannote.github.io/pyannote-metrics/reference.html#diarization) |
+| --------------------------------------------------------------------------------------------------- | ------ |
+| [AMI `only_words` evaluation set](https://github.com/BUTSpeechFIT/AMI-diarization-setup)            | 21.3% |
+| [DIHARD 3 evaluation set](https://arxiv.org/abs/2012.01477)                                         | 22.2% |
+| [VoxConverse 0.0.2 evaluation set](https://github.com/joonson/voxconverse)                          | 13.0% |
+## Support
+For commercial enquiries and scientific consulting, please contact [me](mailto:herve@niderb.fr).
+For [technical questions](https://github.com/pyannote/pyannote-audio/discussions) and [bug reports](https://github.com/pyannote/pyannote-audio/issues), please check [pyannote.audio](https://github.com/pyannote/pyannote-audio) Github repository.
+## Citation
+```bibtex
+@inproceedings{Bredin2020,
+  Title = {{pyannote.audio: neural building blocks for speaker diarization}},
+  Author = {{Bredin}, Herv{\'e} and {Yin}, Ruiqing and {Coria}, Juan Manuel and {Gelly}, Gregory and {Korshunov}, Pavel and {Lavechin}, Marvin and {Fustes}, Diego and {Titeux}, Hadrien and {Bouaziz}, Wassim and {Gill}, Marie-Philippe},
+  Booktitle = {ICASSP 2020, IEEE International Conference on Acoustics, Speech, and Signal Processing},
+  Address = {Barcelona, Spain},
+  Month = {May},
+  Year = {2020},
+}
+```

config.yaml ADDED Viewed

	@@ -0,0 +1,18 @@

+pipeline:
+  name: pyannote.audio.pipelines.SpeakerDiarization
+  params:
+    segmentation: pyannote/segmentation
+    embedding: speechbrain/spkrec-ecapa-voxceleb
+    clustering: AgglomerativeClustering
+params:
+  clustering:
+    method: average
+    threshold: 0.582398766878762
+  min_activity: 6.073193238899291
+  min_duration_off: 0.09791355693027545
+  min_duration_on: 0.05537587440407595
+  offset: 0.4806866463041527
+  onset: 0.8104268538848918
+  stitch_threshold: 0.04033955907446252