File size: 2,341 Bytes
740d8bb
 
 
551fed2
740d8bb
0511591
740d8bb
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d7035dc
740d8bb
d3be907
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2eda451
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

---
language: gl
license: apache-2.0
datasets:
- openslr
metrics:
- wer
- cer
tags:
- audio
- automatic-speech-recognition
- gl 
model-index:
- name: Wav2Vec2-Large-XLSR-53-Galician-With-LM
  results:
  - task: 
      name: Automatic Speech Recognition
      type: automatic-speech-recognition
    dataset:
      name: OpenSLR
      type: openslr
      args: gl
    metrics:
       - name: Test WER
         type: wer
         value: 9.10
       - name: Test CER
         type: cer
         value: 3.94
       - name: Test WER (+LM)
         type: wer
         value: 6.86
       - name: Test CER (+LM)
         type: cer
         value: 2.20
  - task:
      name: Automatic Speech Recognition
      type: automatic-speech-recognition
    dataset:
      name: Common Voice 7.0
      type: mozilla-foundation/common_voice_7_0
      args: gl
    metrics:
       - name: Test WER
         type: wer
         value: 22.12
       - name: Test CER
         type: cer
         value: 5.09
       - name: Test WER (+LM)
         type: wer
         value: 15.20
       - name: Test CER (+LM)
         type: cer
         value: 3.87


---

## Wav2Vec2-Large-XLSR-53-Galician-With-LM

This is a copy of the model [diego-fustes/wav2vec2-large-xlsr-gl](https://huggingface.co/diego-fustes/wav2vec2-large-xlsr-gl) with an integrated language model.


**Improvement**
This model has been compared with the baseline (diego-fustes/wav2vec2-large-xlsr-gl) on:

- The test subset of the Galician OpenSLR dataset (837 speech samples)
- The test subset of the Galician Common Voice 7.0 dataset (1716 speech samples)

The results are shown in the following tables:

OpenSLR77:
| Model | WER | CER |
| ------------- | ------------- | ------------- |
| diego-fustes/wav2vec2-large-xlsr-gl | **9.10%** | **3.94%** |
| cmagui/wav2vec2-large-xlsr-53-galician-with-lm | **6.86%** | **2.20%** |

Common_voice-gl:
| Model | WER | CER |
| ------------- | ------------- | ------------- |
| diego-fustes/wav2vec2-large-xlsr-gl | **22.12%** | **5.09%** |
| cmagui/wav2vec2-large-xlsr-53-galician-with-lm | **15.20%** | **3.87%** |


**Citation information**

If you use this model, please cite as follows:

Magariños, Carmen; Gamallo, Pablo. 2023. Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm. URL: https://huggingface.co/proxectonos/Nos_ASR-wav2vec2-large-xlsr-53-gl-with-lm