AndrewMcDowell commited on
Commit
b4be586
1 Parent(s): 6767044

Update README.md

Browse files

Add eval metrics.

Files changed (1) hide show
  1. README.md +27 -2
README.md CHANGED
@@ -6,11 +6,27 @@ tags:
6
  - automatic-speech-recognition
7
  - mozilla-foundation/common_voice_8_0
8
  - generated_from_trainer
 
 
9
  datasets:
10
  - common_voice
11
  model-index:
12
- - name: ''
13
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -23,6 +39,8 @@ It achieves the following results on the evaluation set:
23
  - Loss: 0.5351
24
  - Wer: 2.6188
25
 
 
 
26
  ## Model description
27
 
28
  More information needed
@@ -73,3 +91,10 @@ The following hyperparameters were used during training:
73
  - Pytorch 1.10.2+cu102
74
  - Datasets 1.18.2.dev0
75
  - Tokenizers 0.11.0
 
 
 
 
 
 
 
 
6
  - automatic-speech-recognition
7
  - mozilla-foundation/common_voice_8_0
8
  - generated_from_trainer
9
+ - robust-speech-event
10
+ - ja
11
  datasets:
12
  - common_voice
13
  model-index:
14
+ - name: 'XLS-R-300-m'
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 8
21
+ type: mozilla-foundation/common_voice_8_0
22
+ args: ja
23
+ metrics:
24
+ - name: Test WER
25
+ type: wer
26
+ value: 94.91
27
+ - name: Test CER
28
+ type: cer
29
+ value: 23.32
30
  ---
31
 
32
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
39
  - Loss: 0.5351
40
  - Wer: 2.6188
41
 
42
+ Kanji are converted into Hiragana using the [pykakasi](https://pykakasi.readthedocs.io/en/latest/index.html) library during training and evaluation. The model can output both Hiragana and Katakana characters.
43
+
44
  ## Model description
45
 
46
  More information needed
 
91
  - Pytorch 1.10.2+cu102
92
  - Datasets 1.18.2.dev0
93
  - Tokenizers 0.11.0
94
+
95
+ #### Evaluation Commands
96
+ 1. To evaluate on `mozilla-foundation/common_voice_8_0` with split `test`
97
+
98
+ ```bash
99
+ python ./eval.py --model_id AndrewMcDowell/wav2vec2-xls-r-300m-japanese --dataset mozilla-foundation/common_voice_8_0 --config ja --split test --log_outputs
100
+ ```