whisper-demo-mix-es

Runtime error

App Files Files Community

Apply for community grant: Personal project

by deepdml - opened Dec 12, 2022

Discussion

deepdml

Owner Dec 12, 2022

@reach-vb @sanchit-gandhi

reach-vb

Dec 12, 2022

Hi @deepdml - Thank you for applying for a community grant! Can you please ping us your model's WER and how it compares with Whisper medium/ large-v2 in zero-shot performance?

deepdml

Owner Dec 13, 2022

In all cases this fine tuned model WER is better than medium and large spanish version models, except for fleurs dataset (I'm analyzing now why in this case is worse):

facebook/multilingual_librispeech: 4.66 % WER
mozilla-foundation/common_voice_11_0: 6.34 %
facebook/voxpopuli: 8.37 %
google/fleurs: 4.03 %

deepdml

Owner Dec 14, 2022

•

edited Dec 14, 2022

I've been comparing the best medium spanish model on the leaderboard, whisper-medium-es, and it looks like having done fine tuning only with common-voice-11 produces an overfitting. Couldn't be the leaderboard the average WER over different test datasets?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment