geninhu commited on
Commit
4fb6aa5
1 Parent(s): 48f71da

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -16
README.md CHANGED
@@ -1,35 +1,55 @@
1
  ---
2
  license: apache-2.0
 
 
3
  tags:
4
  - automatic-speech-recognition
5
- - geninhu/fpt-vi
6
- - generated_from_trainer
7
  model-index:
8
  - name: xls-asr-vi-40h-1B
9
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
- should probably proofread and complete it, then remove this comment. -->
14
 
15
  # xls-asr-vi-40h-1B
16
 
17
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the GENINHU/FPT-VI - NA dataset.
18
- It achieves the following results on the evaluation set:
19
- - Loss: 4.1691
20
- - Wer: 0.4133
21
 
22
- ## Model description
 
 
 
 
23
 
24
- More information needed
 
 
 
 
25
 
26
- ## Intended uses & limitations
27
 
28
- More information needed
29
 
30
- ## Training and evaluation data
31
-
32
- More information needed
33
 
34
  ## Training procedure
35
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - vi
5
  tags:
6
  - automatic-speech-recognition
7
+ - robust-speech-event
8
+ - common-voice
9
  model-index:
10
  - name: xls-asr-vi-40h-1B
11
+ results:
12
+ - task:
13
+ name: Speech Recognition
14
+ type: automatic-speech-recognition
15
+ dataset:
16
+ name: Common Voice 7.0
17
+ type: mozilla-foundation/common_voice_7_0
18
+
19
+ args: vi
20
+ metrics:
21
+ - name: Test WER
22
+ type: wer
23
+ value: 34.210
24
+ - name: Test CER
25
+ type: cer
26
+ value: 19.938
27
  ---
28
 
 
 
29
 
30
  # xls-asr-vi-40h-1B
31
 
32
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the 40 hours of Vietnamese ASR data, including common_voice 7.0 and private dataset.
 
 
 
33
 
34
+ ### Benchmark WER result:
35
+ | | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
36
+ |---|---|---|
37
+ |without LM| 25.93 | 34.21 |
38
+ |with 4-grams LM| | |
39
 
40
+ ### Benchmark CER result:
41
+ | | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
42
+ |---|---|---|
43
+ |without LM| 9.243 | 19.938 |
44
+ |with 4-grams LM| | |
45
 
46
+ ## Evaluation
47
 
48
+ Please use eval.py file to run evaluation
49
 
50
+ ```python
51
+ python eval_custom.py --model_id geninhu/xls-asr-vi-40h-1B --dataset mozilla-foundation/common_voice_7_0 --config vi --split test --log_outputs
52
+ ```
53
 
54
  ## Training procedure
55