Akashpb13 commited on
Commit
43b53a7
1 Parent(s): 25d3f8f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -28
README.md CHANGED
@@ -4,13 +4,13 @@ language:
4
  license: apache-2.0
5
  tags:
6
  - automatic-speech-recognition
7
- - mozilla-foundation/common_voice_7_0
8
  - generated_from_trainer
9
  - hu
10
  - robust-speech-event
11
  - model_for_talk
12
  datasets:
13
- - mozilla-foundation/common_voice_7_0
14
 
15
  model-index:
16
  - name: Akashpb13/xlsr_hungarian_new
@@ -19,16 +19,16 @@ model-index:
19
  name: Automatic Speech Recognition
20
  type: automatic-speech-recognition
21
  dataset:
22
- name: Common Voice 7
23
- type: mozilla-foundation/common_voice_7_0
24
  args: hu
25
  metrics:
26
  - name: Test WER
27
  type: wer
28
- value: 0.02698525418772714
29
  - name: Test CER
30
  type: cer
31
- value: 0.005033063261641211
32
  - task:
33
  name: Automatic Speech Recognition
34
  type: automatic-speech-recognition
@@ -39,18 +39,18 @@ model-index:
39
  metrics:
40
  - name: Test WER
41
  type: wer
42
- value: 0.02698525418772714
43
  - name: Test CER
44
  type: cer
45
- value: 0.005033063261641211
46
  ---
47
 
48
  # Akashpb13/xlsr_hungarian_new
49
 
50
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - hu dataset.
51
- It achieves the following results on evaluation set (which is 10 percent of train data set merged with invalidated data, reported, other, dev and validated datasets):
52
- - Loss: 0.184265
53
- - Wer: 0.292771
54
  ## Model description
55
  "facebook/wav2vec2-xls-r-300m" was finetuned.
56
 
@@ -73,8 +73,6 @@ The following hyperparameters were used during training:
73
  - eval_batch_size: 16
74
  - seed: 13
75
  - gradient_accumulation_steps: 16
76
- - total_train_batch_size: 316
77
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
78
  - lr_scheduler_type: cosine_with_restarts
79
  - lr_scheduler_warmup_steps: 500
80
  - num_epochs: 100
@@ -83,19 +81,14 @@ The following hyperparameters were used during training:
83
 
84
  ### Training results
85
 
86
- Step | Training Loss | Validation Loss | Wer
87
- ------|---------------|-----------------|----------
88
- 500 | 4.825900 | 1.001413 | 0.810308
89
- 1000 | 0.561400 | 0.202275 | 0.361987
90
- 1500 | 0.298900 | 0.169643 | 0.326449
91
- 2000 | 0.236500 | 0.168602 | 0.316215
92
- 2500 | 0.199100 | 0.182484 | 0.308587
93
- 3000 | 0.179100 | 0.178076 | 0.303005
94
- 3500 | 0.161500 | 0.179107 | 0.299935
95
- 4000 | 0.151700 | 0.183371 | 0.295283
96
- 4500 | 0.143700 | 0.184443 | 0.295283
97
- 5000 | 0.138900 | 0.184265 | 0.292771
98
-
99
 
100
  ### Framework versions
101
  - Transformers 4.16.0.dev0
@@ -105,9 +98,9 @@ Step | Training Loss | Validation Loss | Wer
105
 
106
  #### Evaluation Commands
107
 
108
- 1. To evaluate on `mozilla-foundation/common_voice_7_0` with split `test`
109
 
110
  ```bash
111
- python eval.py --model_id Akashpb13/xlsr_hungarian_new --dataset mozilla-foundation/common_voice_7_0 --config hu --split test
112
  ```
113
 
 
4
  license: apache-2.0
5
  tags:
6
  - automatic-speech-recognition
7
+ - mozilla-foundation/common_voice_8_0
8
  - generated_from_trainer
9
  - hu
10
  - robust-speech-event
11
  - model_for_talk
12
  datasets:
13
+ - mozilla-foundation/common_voice_8_0
14
 
15
  model-index:
16
  - name: Akashpb13/xlsr_hungarian_new
 
19
  name: Automatic Speech Recognition
20
  type: automatic-speech-recognition
21
  dataset:
22
+ name: Common Voice 8
23
+ type: mozilla-foundation/common_voice_8_0
24
  args: hu
25
  metrics:
26
  - name: Test WER
27
  type: wer
28
+ value: 0.2851621517163838
29
  - name: Test CER
30
  type: cer
31
+ value: 0.06112982522287432
32
  - task:
33
  name: Automatic Speech Recognition
34
  type: automatic-speech-recognition
 
39
  metrics:
40
  - name: Test WER
41
  type: wer
42
+ value: 0.2851621517163838
43
  - name: Test CER
44
  type: cer
45
+ value: 0.06112982522287432
46
  ---
47
 
48
  # Akashpb13/xlsr_hungarian_new
49
 
50
  This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - hu dataset.
51
+ It achieves the following results on evaluation set (which is 10 percent of train data set merged with invalidated data, reported, other and dev datasets):
52
+ - Loss: 0.197464
53
+ - Wer: 0.330094
54
  ## Model description
55
  "facebook/wav2vec2-xls-r-300m" was finetuned.
56
 
 
73
  - eval_batch_size: 16
74
  - seed: 13
75
  - gradient_accumulation_steps: 16
 
 
76
  - lr_scheduler_type: cosine_with_restarts
77
  - lr_scheduler_warmup_steps: 500
78
  - num_epochs: 100
 
81
 
82
  ### Training results
83
 
84
+ | Step | Training Loss | Validation Loss | Wer |
85
+ |------|---------------|-----------------|----------|
86
+ | 500 | 4.785300 | 0.952295 | 0.796236 |
87
+ | 1000 | 0.535800 | 0.217474 | 0.381613 |
88
+ | 1500 | 0.258400 | 0.205524 | 0.345056 |
89
+ | 2000 | 0.202800 | 0.198680 | 0.336264 |
90
+ | 2500 | 0.182700 | 0.197464 | 0.330094 |
91
+
 
 
 
 
 
92
 
93
  ### Framework versions
94
  - Transformers 4.16.0.dev0
 
98
 
99
  #### Evaluation Commands
100
 
101
+ 1. To evaluate on `mozilla-foundation/common_voice_8_0` with split `test`
102
 
103
  ```bash
104
+ python eval.py --model_id Akashpb13/xlsr_hungarian_new --dataset mozilla-foundation/common_voice_8_0 --config hu --split test
105
  ```
106