Whisper model finetuned using audio data from CommonVoice Ukrainian v10 train and dev set with additional data via semi-supervised data.

There is a differences in tokenization of source data (in our data normalization process, we replace punctucation with "" rather than Whisper's " "). This mismatch leads to a slight degradation on CommonVoice.

Downloads last month: 20

Inference Providers NEW

Automatic Speech Recognition

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Evaluation results

WER on mozilla-foundation/common_voice_11_0
test set self-reported

13.010

View on Papers With Code