whisper identified the wrong language

#41

by lypspeech - opened Apr 26, 2023

Apr 26, 2023

When I follow the example of long-form transcription for whisper-large with Korean, the result is English. But after finetuning the whisper-large model with some Korean data, the checkpoint can output Korean. I also test other model size, but all the models output English.
I was confused about it. How should I do to output Korean with the original model?

liushaowei

May 9, 2023

me too

atulyaatul

Aug 31, 2023

you can try this:
pipe = pipeline(
"automatic-speech-recognition",
model="openai/whisper-large-v2",
generate_kwargs={"language": "br", "task": "transcribe"},
device="cpu",
use_fast=True
)

ArthurZ

Sep 4, 2023

You should read the doc about how to properly set the task for transcription instead of translation! As mentioned by @atulyaatul

ArthurZ changed discussion status to closed Sep 4, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment