saattrupdan
/

wav2vec2-xls-r-300m-cv8-da

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

wav2vec2-xls-r-300m-cv8-da / README.md

saattrupdan's picture

Update README.md

547a971 about 2 years ago

|

raw history blame contribute delete

No virus

1.37 kB

	---
	language:
	- da
	license: apache-2.0
	tasks:
	- automatic-speech-recognition
	datasets:
	- common_voice_8_0
	metrics:
	- wer
	model-index:
	- name: wav2vec2-xls-r-300m-cv8-da
	results:
	- task:
	type: automatic-speech-recognition
	dataset:
	type: mozilla-foundation/common_voice_8_0
	args: da
	name: Danish Common Voice 8.0
	metrics:
	- type: wer
	value: 26.45
	- task:
	type: automatic-speech-recognition
	dataset:
	type: Alvenir/alvenir_asr_da_eval
	name: Alvenir ASR test dataset
	metrics:
	- type: wer
	value: 25.80
	---

	# XLS-R-300m-CV8-da

	## Model description

	This model is a fine-tuned version of the multilingual acoustic model [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the Danish part of [Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), containing ~6 crowdsourced hours of read-aloud Danish speech.


	## Performance

	The model achieves the following WER scores (lower is better):

	\| Dataset \| WER without LM \| WER with 5-gram LM \|
	\| :---: \| ---: \| ---: \|
	\| [Danish part of Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0/viewer/da/train) \| 31.33 \| 26.45 \|
	\| [Alvenir test set](https://huggingface.co/datasets/Alvenir/alvenir_asr_da_eval) \| 30.54 \| 25.80 \|