mozilla-ai
/

whisper-large-v3-turbo-bn

Automatic Speech Recognition

Model card Files Files and versions Metrics Training metrics Community

Finetuned openai/whisper-large-v3-turbo on 21409 Bengali training audio samples from cv-corpus-21.0-2025-03-14/bn.

This model was created from the Mozilla.ai Blueprint: speech-to-text-finetune.

Evaluation results on 9363 audio samples of Bengali:

Baseline model (before finetuning) on Bengali

Word Error Rate (Normalized): 78.843
Word Error Rate (Orthographic): 107.027
Character Error Rate (Normalized): 62.521
Character Error Rate (Orthographic): 72.012
Loss: 1.074

Finetuned model (after finetuning) on Bengali

Word Error Rate (Normalized): 11.053
Word Error Rate (Orthographic): 26.436
Character Error Rate (Normalized): 6.059
Character Error Rate (Orthographic): 7.537
Loss: 0.109

Downloads last month: 4

Safetensors

Model size

809M params

Tensor type

F32

·

Inference Providers NEW

Automatic Speech Recognition

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mozilla-ai/whisper-large-v3-turbo-bn

Base model

openai/whisper-large-v3

Finetuned

openai/whisper-large-v3-turbo

Finetuned

(330)

this model

Collection including mozilla-ai/whisper-large-v3-turbo-bn

Common Voice Whisper

Whisper models finetuned on the Common Voice dataset using https://github.com/mozilla-ai/speech-to-text-finetune • 9 items • Updated Jul 8 • 2

Evaluation results

wer on Common Voice (Bengali)
self-reported

11.053

View on Papers With Code