Lemswasabi commited on
Commit
247af8e
1 Parent(s): 1078c9a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -2
README.md CHANGED
@@ -8,6 +8,26 @@ metrics:
8
  - wer
9
  pipeline_tag: automatic-speech-recognition
10
  license: mit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,7 +35,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  ## Model description
17
 
18
- More information needed
 
 
19
 
20
  ## Intended uses & limitations
21
 
@@ -47,4 +69,18 @@ The following hyperparameters were used during training:
47
  - Transformers 4.20.0.dev0
48
  - Pytorch 1.11.0+cu113
49
  - Datasets 2.2.1
50
- - Tokenizers 0.12.1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  - wer
9
  pipeline_tag: automatic-speech-recognition
10
  license: mit
11
+
12
+ model-index:
13
+ - name: Lemswasabi/wav2vec2-large-xlsr-53-842h-luxembourgish-14h-with-lm
14
+ results:
15
+ - task:
16
+ type: automatic-speech-recognition # Required. Example: automatic-speech-recognition
17
+ name: Speech Recognition # Optional. Example: Speech Recognition
18
+ metrics:
19
+ - type: wer
20
+ value: 11.68
21
+ name: Dev WER
22
+ - type: wer
23
+ value: 10.71
24
+ name: Test WER
25
+ - type: cer
26
+ value: 2.64
27
+ name: Dev CER
28
+ - type: cer
29
+ value: 2.31
30
+ name: Test CER
31
  ---
32
 
33
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
35
 
36
  ## Model description
37
 
38
+ We fine-tuned a wav2vec 2.0 large XLSR-53 checkpoint with 842h of unlabelled Luxembourgish speech
39
+ collected from [RTL.lu](https://www.rtl.lu/). Then the model was fine-tuned on 14h of labelled
40
+ Luxembourgish Speech from the same domain.
41
 
42
  ## Intended uses & limitations
43
 
 
69
  - Transformers 4.20.0.dev0
70
  - Pytorch 1.11.0+cu113
71
  - Datasets 2.2.1
72
+ - Tokenizers 0.12.1
73
+
74
+ ## Citation
75
+
76
+ This model is a result of our paper `IMPROVING LUXEMBOURGISH SPEECH RECOGNITION WITH CROSS-LINGUAL SPEECH REPRESENTATIONS` submitted to the [IEEE SLT 2022 workshop](https://slt2022.org/)
77
+
78
+ ```
79
+ @misc{lb-wav2vec2,
80
+ author = {Nguyen, Le Minh and Nayak, Shekhar and Coler, Matt.},
81
+ keywords = {Luxembourgish, multilingual speech recognition, language modelling, wav2vec 2.0 XLSR-53, under-resourced language},
82
+ title = {IMPROVING LUXEMBOURGISH SPEECH RECOGNITION WITH CROSS-LINGUAL SPEECH REPRESENTATIONS},
83
+ year = {2022},
84
+ copyright = {2023 IEEE}
85
+ }
86
+ ```