vutankiet2901 commited on
Commit
a554061
1 Parent(s): 449e4ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -26
README.md CHANGED
@@ -1,38 +1,78 @@
1
  ---
 
2
  language:
3
  - ja
4
- license: apache-2.0
5
  tags:
6
  - automatic-speech-recognition
7
- - mozilla-foundation/common_voice_8_0
8
- - generated_from_trainer
 
9
  model-index:
10
- - name: wav2vec2-xls-r-1b-ja
11
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  ---
13
-
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
-
17
- # wav2vec2-xls-r-1b-ja
18
-
19
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - JA dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 0.3316
22
- - Wer: 0.2564
23
- - Cer: 0.1218
24
-
25
  ## Model description
26
 
27
- More information needed
28
-
29
- ## Intended uses & limitations
30
-
31
- More information needed
32
-
33
- ## Training and evaluation data
34
-
35
- More information needed
 
 
 
 
 
 
 
 
 
36
 
37
  ## Training procedure
38
 
 
1
  ---
2
+ license: apache-2.0
3
  language:
4
  - ja
 
5
  tags:
6
  - automatic-speech-recognition
7
+ - robust-speech-event
8
+ - common-voice
9
+ - ja
10
  model-index:
11
+ - name: wav2vec2-xls-r-1b
12
+ results:
13
+ - task:
14
+ name: Speech Recognition
15
+ type: automatic-speech-recognition
16
+ dataset:
17
+ name: Common Voice 7.0
18
+ type: mozilla-foundation/common_voice_7_0
19
+ args: ja
20
+ metrics:
21
+ - name: Test WER (with LM)
22
+ type: wer
23
+ value: 11.77
24
+ - name: Test CER (with LM)
25
+ type: cer
26
+ value: 5.22
27
+ - task:
28
+ name: Speech Recognition
29
+ type: automatic-speech-recognition
30
+ dataset:
31
+ name: Common Voice 8.0
32
+ type: mozilla-foundation/common_voice_8_0
33
+ args: ja
34
+ metrics:
35
+ - name: Test WER (with LM)
36
+ type: wer
37
+ value: 12.23
38
+ - name: Test CER (with LM)
39
+ type: cer
40
+ value: 5.33
41
+ - task:
42
+ name: Speech Recognition
43
+ type: automatic-speech-recognition
44
+ dataset:
45
+ name: Robust Speech Event - Dev Data
46
+ type: speech-recognition-community-v2/dev_data
47
+ args: ja
48
+ metrics:
49
+ - name: Test WER (with LM)
50
+ type: wer
51
+ value: 29.35
52
+ - name: Test CER (with LM)
53
+ type: cer
54
+ value: 16.43
55
  ---
 
 
 
 
 
 
 
 
 
 
 
 
56
  ## Model description
57
 
58
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - JA
59
+
60
+ ### Benchmark WER result:
61
+ | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
62
+ |---|---|---|
63
+ |without LM| 16.97 | 17.95 |
64
+ |with 4-grams LM| 11.77 | 12.23|
65
+ ### Benchmark CER result:
66
+ | | [COMMON VOICE 7.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) | [COMMON VOICE 8.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0)
67
+ |---|---|---|
68
+ |without LM| 6.82 | 7.05 |
69
+ |with 4-grams LM| 5.22 | 5.33 |
70
+ ## Evaluation
71
+ Please use the eval.py file to run the evaluation:
72
+ ```python
73
+ pip install mecab-python3 unidic-lite pykakasi
74
+ python eval.py --model_id vutankiet2901/wav2vec2-xls-r-1b-ja --dataset mozilla-foundation/common_voice_7_0 --config ja --split test --log_outputs
75
+ ```
76
 
77
  ## Training procedure
78