Update README.md

#126
by reach-vb HF staff - opened

Replicating the model cards for Distil-whisper-large-v3 over to whisper-large-v3

Thanks for bringing these changes over from distil-large-v3 and updating accordingly! Compared to distil-whisper, large-v3 is multilingual, and we frequently get asked how to set the language/task args. It would be great to add a note on forcing/automatically detecting the source audio language, and switching between transcribe/translate

(nit) we can also run Transformers from the latest version, rather than main. I'll update this for distil-large-v3 now as well

GG! It does already mention this in the Short Transcription section:

The above arguments can be used in isolation or in combination. For example, to perform the task of speech transcription where the source audio is in French, and we want to return sentence-level timestamps, the following can be used:

result = pipe(sample, return_timestamps=True, generate_kwargs={"language": "french", "task": "translate"})
print(result["chunks"])

Let me know if this doesn't make sense. (this was already there in the model card)

Perfect, thanks @reach-vb 🙌

sanchit-gandhi changed pull request status to merged

Sign up or log in to comment