kingabzpro commited on
Commit
0436a8c
1 Parent(s): bb1f2f8

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -70
README.md CHANGED
@@ -1,58 +1,11 @@
1
  ---
2
- language:
3
- - ur
4
-
5
- license: apache-2.0
6
  tags:
7
- - automatic-speech-recognition
8
- - robust-speech-event
9
  datasets:
10
- - mozilla-foundation/common_voice_8_0
11
- metrics:
12
- - wer
13
- - cer
14
  model-index:
15
- - name: wav2vec2-urdu-V8-Abid
16
- results:
17
- - task:
18
- type: automatic-speech-recognition # Required. Example: automatic-speech-recognition
19
- name: Speech Recognition # Optional. Example: Speech Recognition
20
- dataset:
21
- type: mozilla-foundation/common_voice_8_0 # Required. Example: common_voice. Use dataset id from https://hf.co/datasets
22
- name: Common Voice ur # Required. Example: Common Voice zh-CN
23
- args: ur # Optional. Example: zh-CN
24
- metrics:
25
- - type: wer # Required. Example: wer
26
- value: 39.52 # Required. Example: 20.90
27
- name: Test WER # Optional. Example: Test WER
28
- args:
29
- - learning_rate: 0.00007
30
- - train_batch_size: 64
31
- - eval_batch_size: 8
32
- - seed: 42
33
- - gradient_accumulation_steps: 4
34
- - total_train_batch_size: 128
35
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
36
- - lr_scheduler_type: linear
37
- - lr_scheduler_warmup_steps: 100
38
- - num_epochs: 100
39
- - mixed_precision_training: Native AMP # Optional. Example for BLEU: max_order
40
- - type: cer # Required. Example: wer
41
- value: 17.60 # Required. Example: 20.90
42
- name: Test CER # Optional. Example: Test WER
43
- args:
44
- - learning_rate: 0.00007
45
- - train_batch_size: 64
46
- - eval_batch_size: 8
47
- - seed: 42
48
- - gradient_accumulation_steps: 4
49
- - total_train_batch_size: 128
50
- - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
- - lr_scheduler_type: linear
52
- - lr_scheduler_warmup_steps: 100
53
- - num_epochs: 100
54
- - mixed_precision_training: Native AMP
55
-
56
  ---
57
 
58
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -60,45 +13,56 @@ should probably proofread and complete it, then remove this comment. -->
60
 
61
  # wav2vec2-60-Urdu-V8
62
 
63
- This model is a fine-tuned version of [kingabzpro/wav2vec2-urdu](https://huggingface.co/kingabzpro/wav2vec2-urdu) on the common_voice dataset.
64
  It achieves the following results on the evaluation set:
65
- - Loss: 4.9192
66
- - Wer: 0.4741
67
- - Cer: 0.2504
 
 
 
 
 
 
 
 
 
 
68
 
 
69
 
70
  ## Training procedure
71
 
72
  ### Training hyperparameters
73
 
74
  The following hyperparameters were used during training:
75
- - learning_rate: 7e-05
76
- - train_batch_size: 64
77
  - eval_batch_size: 8
78
  - seed: 42
79
- - gradient_accumulation_steps: 4
80
- - total_train_batch_size: 256
81
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
82
  - lr_scheduler_type: linear
83
- - lr_scheduler_warmup_steps: 100
84
- - num_epochs: 100
85
  - mixed_precision_training: Native AMP
86
 
87
  ### Training results
88
 
89
  | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
90
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
91
- | 2.8836 | 16.62 | 50 | 4.7827 | 0.5011 | 0.2625 |
92
- | 0.6992 | 33.31 | 100 | 3.5358 | 0.4882 | 0.2537 |
93
- | 0.6321 | 49.92 | 150 | 4.9054 | 0.4774 | 0.2519 |
94
- | 0.4669 | 66.62 | 200 | 5.9508 | 0.4719 | 0.2513 |
95
- | 0.3119 | 83.31 | 250 | 5.5791 | 0.4745 | 0.2508 |
96
- | 0.2788 | 99.92 | 300 | 4.9192 | 0.4741 | 0.2504 |
97
 
98
 
99
  ### Framework versions
100
 
101
- - Transformers 4.17.0.dev0
102
- - Pytorch 1.10.2+cu102
103
- - Datasets 1.18.2.dev0
104
  - Tokenizers 0.11.0
 
1
  ---
 
 
 
 
2
  tags:
3
+ - generated_from_trainer
 
4
  datasets:
5
+ - common_voice
 
 
 
6
  model-index:
7
+ - name: wav2vec2-60-Urdu-V8
8
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
13
 
14
  # wav2vec2-60-Urdu-V8
15
 
16
+ This model is a fine-tuned version of [Harveenchadha/vakyansh-wav2vec2-urdu-urm-60](https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-urdu-urm-60) on the common_voice dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 11.4832
19
+ - Wer: 0.5729
20
+ - Cer: 0.3170
21
+
22
+ ## Model description
23
+
24
+ More information needed
25
+
26
+ ## Intended uses & limitations
27
+
28
+ More information needed
29
+
30
+ ## Training and evaluation data
31
 
32
+ More information needed
33
 
34
  ## Training procedure
35
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - learning_rate: 7.5e-05
40
+ - train_batch_size: 16
41
  - eval_batch_size: 8
42
  - seed: 42
43
+ - gradient_accumulation_steps: 2
44
+ - total_train_batch_size: 32
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - lr_scheduler_warmup_steps: 200
48
+ - num_epochs: 50
49
  - mixed_precision_training: Native AMP
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|
55
+ | 19.671 | 8.33 | 100 | 7.7671 | 0.8795 | 0.4492 |
56
+ | 2.085 | 16.67 | 200 | 9.2759 | 0.6201 | 0.3320 |
57
+ | 0.6633 | 25.0 | 300 | 8.7025 | 0.5738 | 0.3104 |
58
+ | 0.388 | 33.33 | 400 | 10.2286 | 0.5852 | 0.3128 |
59
+ | 0.2822 | 41.67 | 500 | 11.1953 | 0.5738 | 0.3174 |
60
+ | 0.2293 | 50.0 | 600 | 11.4832 | 0.5729 | 0.3170 |
61
 
62
 
63
  ### Framework versions
64
 
65
+ - Transformers 4.16.2
66
+ - Pytorch 1.10.0+cu111
67
+ - Datasets 1.18.3
68
  - Tokenizers 0.11.0