Speaker Embedding

#64
by bertrand-fournel - opened

Hi ! Is it possible de perform Speaker Embedding with Whisper ? For example, encode a few seconds of audio (a speaker) to a vector, encode a second audio file with another speaker and get the "distance" (cosine similarity for example) between two voices (or between voice of same speaker), thanks you (excuse my english).

use pyannote

Sign up or log in to comment