saattrupdan
/

voxpopuli-wav2vec2-large-cv8-da

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

voxpopuli-wav2vec2-large-cv8-da / README.md

saattrupdan's picture

Update README.md

fddf4d0 over 2 years ago

|

history blame contribute delete

1.41 kB

	---
	language:
	- da
	license: cc-by-nc-4.0
	tasks:
	- automatic-speech-recognition
	datasets:
	- common_voice_8_0
	metrics:
	- wer
	model-index:
	- name: voxpopuli-wav2vec2-large-cv8-da
	results:
	- task:
	type: automatic-speech-recognition
	dataset:
	type: mozilla-foundation/common_voice_8_0
	args: da
	name: Danish Common Voice 8.0
	metrics:
	- type: wer
	value: 40.54
	- task:
	type: automatic-speech-recognition
	dataset:
	type: Alvenir/alvenir_asr_da_eval
	name: Alvenir ASR test dataset
	metrics:
	- type: wer
	value: 40.66
	---

	# VoxPopuli-Wav2vec2-large-CV8-da

	## Model description

	This model is a fine-tuned version of the Swedish acoustic model [facebook/wav2vec2-large-sv-voxpopuli](https://huggingface.co/facebook/wav2vec2-large-sv-voxpopuli) on the Danish part of [Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), containing ~6 crowdsourced hours of read-aloud Danish speech.


	## Performance

	The model achieves the following WER scores (lower is better):

	\| Dataset \| WER without LM \| WER with 5-gram LM \|
	\| :---: \| ---: \| ---: \|
	\| [Danish part of Common Voice 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0/viewer/da/train) \| 48.04 \| 40.54 \|
	\| [Alvenir test set](https://huggingface.co/datasets/Alvenir/alvenir_asr_da_eval) \| 48.43 \| 40.66 \|