Could you please suggest how to fine-tune the model on top of Swahili

by nathanhunt - opened May 14, 2023

May 14, 2023

I am trying to understand how to fine tune Whisper model to other languages. However, the WhisperTokenizer doesn't support some language (like Kinyarwanda). I see that you can fine tune it on top of Swahili. Could you please suggest how to train like this?

Kleber

Mbaza NLP org May 16, 2023

Hi, I can suggest two options; the first is to pick the host language(e.g.: Swahili in this case) and then train the target language on top of it. The second option is similar to the first one, but you will also train a BPE tokenizer on the target language, add the obtained tokens to the whisper's tokenizer, then train the model.

nathanhunt

May 16, 2023

Thank you very much. I have quickly try to train the model in the first option. I works fine in the target language. But it cannot transcribe to other language. For example, when I try to transcribe Thai audio, the output always be the target language (not Thai). Is this expected for fine-tuned model?

Kleber

Mbaza NLP org Jul 6, 2023

•

edited Jul 6, 2023

Sorry for the late reply, after fine-tuning the model should still be able to transcribe other languages.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment