comodoro commited on
Commit
38832ec
1 Parent(s): 74c4791

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -7
README.md CHANGED
@@ -23,10 +23,10 @@ model-index:
23
  metrics:
24
  - name: Test WER
25
  type: wer
26
- value: 47.46
27
  - name: Test CER
28
- type: cer
29
- value: 10.88
30
  ---
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
  should probably proofread and complete it, then remove this comment. -->
@@ -35,8 +35,9 @@ should probably proofread and complete it, then remove this comment. -->
35
 
36
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice 8.0 dataset.
37
  It achieves the following results on the evaluation set:
38
- - WER: 0.47455377483706096
39
- - CER: 0.10877155235645618
 
40
 
41
  ## Model description
42
 
@@ -80,7 +81,10 @@ print("Reference:", test_dataset[:2]["sentence"])
80
 
81
  ## Evaluation
82
 
83
- The model can be evaluated using the attached `eval.py` script.
 
 
 
84
 
85
  ## Training and evaluation data
86
 
@@ -90,7 +94,8 @@ The Common Voice 8.0 `train` and `validation` datasets were used for training
90
 
91
  ### Training hyperparameters
92
 
93
- The following hyperparameters were used during training:
 
94
  - learning_rate: 7e-05
95
  - train_batch_size: 32
96
  - eval_batch_size: 8
@@ -103,6 +108,20 @@ The following hyperparameters were used during training:
103
  - num_epochs: 150
104
  - mixed_precision_training: Native AMP
105
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
106
  ### Training results
107
 
108
  | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
@@ -126,6 +145,17 @@ The following hyperparameters were used during training:
126
  | 0.0527 | 137.09 | 4250 | 0.6652 | 0.4749 | 0.1090 |
127
  | 0.0506 | 145.16 | 4500 | 0.6958 | 0.4846 | 0.1133 |
128
 
 
 
 
 
 
 
 
 
 
 
 
129
 
130
  ### Framework versions
131
 
 
23
  metrics:
24
  - name: Test WER
25
  type: wer
26
+ value: 16.1
27
  - name: Test CER
28
+ type: cer
29
+ value: 3.8
30
  ---
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
  should probably proofread and complete it, then remove this comment. -->
 
35
 
36
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice 8.0 dataset.
37
  It achieves the following results on the evaluation set:
38
+ - Loss: 0.2327
39
+ - Wer: 0.1608
40
+ - Cer: 0.0376
41
 
42
  ## Model description
43
 
 
81
 
82
  ## Evaluation
83
 
84
+ The model can be evaluated using the attached `eval.py` script:
85
+ ```
86
+ python eval.py --model_id comodoro/wav2vec2-xls-r-300m-cs-cv8 --dataset mozilla-foundation/common-voice_8_0 --split test --config cs
87
+ ```
88
 
89
  ## Training and evaluation data
90
 
 
94
 
95
  ### Training hyperparameters
96
 
97
+ The following hyperparameters were used during first stage of training:
98
+
99
  - learning_rate: 7e-05
100
  - train_batch_size: 32
101
  - eval_batch_size: 8
 
108
  - num_epochs: 150
109
  - mixed_precision_training: Native AMP
110
 
111
+ The following hyperparameters were used during second stage of training:
112
+
113
+ - learning_rate: 0.001
114
+ - train_batch_size: 32
115
+ - eval_batch_size: 8
116
+ - seed: 42
117
+ - gradient_accumulation_steps: 20
118
+ - total_train_batch_size: 640
119
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
120
+ - lr_scheduler_type: linear
121
+ - lr_scheduler_warmup_steps: 500
122
+ - num_epochs: 50
123
+ - mixed_precision_training: Native AMP
124
+
125
  ### Training results
126
 
127
  | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
 
145
  | 0.0527 | 137.09 | 4250 | 0.6652 | 0.4749 | 0.1090 |
146
  | 0.0506 | 145.16 | 4500 | 0.6958 | 0.4846 | 0.1133 |
147
 
148
+ Further fine-tuning with slightly different architecture and higher learning rate:
149
+
150
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
151
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
152
+ | 0.576 | 8.06 | 250 | 0.2411 | 0.2340 | 0.0502 |
153
+ | 0.2564 | 16.13 | 500 | 0.2305 | 0.2097 | 0.0492 |
154
+ | 0.2018 | 24.19 | 750 | 0.2371 | 0.2059 | 0.0494 |
155
+ | 0.1549 | 32.25 | 1000 | 0.2298 | 0.1844 | 0.0435 |
156
+ | 0.1224 | 40.32 | 1250 | 0.2288 | 0.1725 | 0.0407 |
157
+ | 0.1004 | 48.38 | 1500 | 0.2327 | 0.1608 | 0.0376 |
158
+
159
 
160
  ### Framework versions
161