bofenghuang
/

asr-wav2vec2-xls-r-1b-ctc-french

Automatic Speech Recognition

hf-asr-leaderboard

robust-speech-event

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Fine-tuned Wav2Vec2 XLS-R 1B model for ASR in French

This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on French using the train and validation splits of Common Voice 11.0, Multilingual LibriSpeech, Voxpopuli, Multilingual TEDx, MediaSpeech, and African Accented French on 16kHz sampled speech audio. When using the model make sure that your speech input is also sampled at 16Khz.

Genrally we advise to use bofenghuang/asr-wav2vec2-ctc-french because it has the smaller model size and the better performance.

Downloads last month: 18

Inference Providers NEW

Automatic Speech Recognition

This model is not currently available via any of the supported Inference Providers.

Datasets used to train bofenghuang/asr-wav2vec2-xls-r-1b-ctc-french

Evaluation results

Test WER on Common Voice 11.0
self-reported

14.800
Test WER (+LM) on Common Voice 11.0
self-reported

12.610
Test WER on Multilingual LibriSpeech (MLS)
self-reported

9.390
Test WER (+LM) on Multilingual LibriSpeech (MLS)
self-reported

8.060
Test WER on VoxPopuli
self-reported

11.800
Test WER (+LM) on VoxPopuli
self-reported

9.940
Test WER on African Accented French
self-reported

22.980
Test WER (+LM) on African Accented French
self-reported

20.730
Test WER on Robust Speech Event - Dev Data
self-reported

17.880
Test WER (+LM) on Robust Speech Event - Dev Data
self-reported

14.010

View on Papers With Code