File size: 816 Bytes
1339cff
 
 
 
 
 
 
 
 
 
 
156d9f6
1339cff
 
 
44736a7
 
534bc3a
44736a7
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
language: da
tags:
- speech
- xls_r
- xls_r_pretrained
- danish
license: apache-2.0
---
## XLS-R-300m-danish

Continued pretraining of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) for 120.000 steps on 141.000 hours of speech from Danish radio (DR P1 and Radio24Syv from 2005 to 2021). 

The model was pretrained on 16kHz audio using fairseq and should be fine-tuned to perform speech recognition.  

A fine-tuned version of this model for ASR can be found [here](https://huggingface.co/chcaa/xls-r-300m-danish-nst-cv9).

The model was trained by [Lasse Hansen](https://github.com/HLasse) ([CHCAA](https://chcaa.io)) and [Alvenir](https://alvenir.ai) on the [UCloud](https:/cloud.sdu.dk) platform. Many thanks to the Royal Danish Library for providing access to the data.