--- tags: - pyannote - speaker-diarization datasets: - ami - voxconverse license: mit --- # Speaker diarization Relies on pyannote.audio 2.0 currently in development: see [installation instructions](https://github.com/pyannote/pyannote-audio/tree/develop#installation). ```python from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained("AMITKESARI2000/pyannote_SD1") output = pipeline("audio.wav") for turn, _, speaker in output.itertracks(yield_label=True): # speaker speaks between turn.start and turn.end ... ``` ## Benchmark | Dataset | [Diarization error rate](http://pyannote.github.io/pyannote-metrics/reference.html#diarization) | | --------------------------------------------------------------------------------------------------- | ------ | | [AMI `only_words` evaluation set](https://github.com/BUTSpeechFIT/AMI-diarization-setup) | 21.4% |