Speaker Embedding
#64
by
bertrand-fournel
- opened
Hi ! Is it possible de perform Speaker Embedding with Whisper ? For example, encode a few seconds of audio (a speaker) to a vector, encode a second audio file with another speaker and get the "distance" (cosine similarity for example) between two voices (or between voice of same speaker), thanks you (excuse my english).
use pyannote