kingabzpro commited on
Commit
becf0de
1 Parent(s): f2f25c6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -15
README.md CHANGED
@@ -1,11 +1,57 @@
1
  ---
 
 
 
 
2
  tags:
3
- - generated_from_trainer
 
4
  datasets:
5
- - common_voice
 
 
 
6
  model-index:
7
- - name: wav2vec2-large-xls-r-300m-Urdu
8
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -15,23 +61,14 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [Harveenchadha/vakyansh-wav2vec2-urdu-urm-60](https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-urdu-urm-60) on the common_voice dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 7.6846
19
  - Wer: 0.5747
20
  - Cer: 0.3268
21
 
22
  ## Model description
23
-
24
- More information needed
25
-
26
- ## Intended uses & limitations
27
-
28
- More information needed
29
-
30
- ## Training and evaluation data
31
-
32
- More information needed
33
 
34
  ## Training procedure
 
35
 
36
  ### Training hyperparameters
37
 
 
1
  ---
2
+ language:
3
+ - ur
4
+
5
+ license: apache-2.0
6
  tags:
7
+ - automatic-speech-recognition
8
+ - robust-speech-event
9
  datasets:
10
+ - mozilla-foundation/common_voice_7_0
11
+ metrics:
12
+ - wer
13
+ - cer
14
  model-index:
15
+ - name: wav2vec2-60-urdu
16
+ results:
17
+ - task:
18
+ type: automatic-speech-recognition # Required. Example: automatic-speech-recognition
19
+ name: Speech Recognition # Optional. Example: Speech Recognition
20
+ dataset:
21
+ type: mozilla-foundation/common_voice_7_0 # Required. Example: common_voice. Use dataset id from https://hf.co/datasets
22
+ name: common-voice # Required. Example: Common Voice zh-CN
23
+ args: ur # Optional. Example: zh-CN
24
+ metrics:
25
+ - type: wer # Required. Example: wer
26
+ value: 57.4 # Required. Example: 20.90
27
+ name: Test WER # Optional. Example: Test WER
28
+ args:
29
+ - learning_rate: 0.0003
30
+ - train_batch_size: 64
31
+ - eval_batch_size: 8
32
+ - seed: 42
33
+ - gradient_accumulation_steps: 2
34
+ - total_train_batch_size: 128
35
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
36
+ - lr_scheduler_type: linear
37
+ - lr_scheduler_warmup_steps: 100
38
+ - num_epochs: 100
39
+ - mixed_precision_training: Native AMP # Optional. Example for BLEU: max_order
40
+ - type: cer # Required. Example: wer
41
+ value: 32.6 # Required. Example: 20.90
42
+ name: Test CER # Optional. Example: Test WER
43
+ args:
44
+ - learning_rate: 0.0003
45
+ - train_batch_size: 64
46
+ - eval_batch_size: 8
47
+ - seed: 42
48
+ - gradient_accumulation_steps: 2
49
+ - total_train_batch_size: 128
50
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
+ - lr_scheduler_type: linear
52
+ - lr_scheduler_warmup_steps: 100
53
+ - num_epochs: 100
54
+ - mixed_precision_training: Native AMP
55
  ---
56
 
57
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
61
 
62
  This model is a fine-tuned version of [Harveenchadha/vakyansh-wav2vec2-urdu-urm-60](https://huggingface.co/Harveenchadha/vakyansh-wav2vec2-urdu-urm-60) on the common_voice dataset.
63
  It achieves the following results on the evaluation set:
 
64
  - Wer: 0.5747
65
  - Cer: 0.3268
66
 
67
  ## Model description
68
+ The training and valid dataset is 0.58 hours. It was hard to train any model on lower number of so I decided to take vakyansh-wav2vec2-urdu-urm-60 checkpoint and finetune the wav2vec2 model.
 
 
 
 
 
 
 
 
 
69
 
70
  ## Training procedure
71
+ Trained on Harveenchadha/vakyansh-wav2vec2-urdu-urm-60 due to lesser number of samples.
72
 
73
  ### Training hyperparameters
74