Specific Dialect Speech

by laylabitar - opened


I am aiming to fine-tune whisper in the Jordanian dialect, I was wondering if I would be still using the Arabic tokenizer?

Also, do you have any advice on how to collect voice recordings for training data?

I hope to take this into the next step in transcribing mixed-language speech ( English and Arabic)

Would appreciate any advice,


laylabitar changed discussion status to closed

Sign up or log in to comment