kiranpantha commited on
Commit
64514e8
1 Parent(s): 13128d5

End of training

Browse files
Files changed (2) hide show
  1. README.md +47 -13
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,4 +1,5 @@
1
  ---
 
2
  language:
3
  - ne
4
  license: apache-2.0
@@ -7,17 +8,35 @@ tags:
7
  - generated_from_trainer
8
  datasets:
9
  - openslr/openslr
 
 
10
  model-index:
11
- - name: Whisper Medium Nepali - Kiran Pantha
12
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
  should probably proofread and complete it, then remove this comment. -->
17
 
18
- # Whisper Medium Nepali - Kiran Pantha
19
 
20
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the OpenSLR54 dataset.
 
 
 
21
 
22
  ## Model description
23
 
@@ -36,20 +55,35 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - learning_rate: 3e-05
40
- - train_batch_size: 8
41
- - eval_batch_size: 4
42
  - seed: 42
43
- - gradient_accumulation_steps: 16
44
- - total_train_batch_size: 128
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
- - lr_scheduler_warmup_steps: 200
48
- - training_steps: 2000
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
 
50
  ### Framework versions
51
 
52
- - Transformers 4.44.0
53
- - Pytorch 2.3.1+cu121
54
- - Datasets 2.20.0
55
  - Tokenizers 0.19.1
 
1
  ---
2
+ library_name: transformers
3
  language:
4
  - ne
5
  license: apache-2.0
 
8
  - generated_from_trainer
9
  datasets:
10
  - openslr/openslr
11
+ metrics:
12
+ - wer
13
  model-index:
14
+ - name: Whisper Large Nepali - Kiran Pantha
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: OpenSLR54
21
+ type: openslr/openslr
22
+ config: default
23
+ split: test
24
+ args: 'config: ne, split: test'
25
+ metrics:
26
+ - name: Wer
27
+ type: wer
28
+ value: 48.043676069153776
29
  ---
30
 
31
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
  should probably proofread and complete it, then remove this comment. -->
33
 
34
+ # Whisper Large Nepali - Kiran Pantha
35
 
36
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the OpenSLR54 dataset.
37
+ It achieves the following results on the evaluation set:
38
+ - Loss: 0.3013
39
+ - Wer: 48.0437
40
 
41
  ## Model description
42
 
 
55
  ### Training hyperparameters
56
 
57
  The following hyperparameters were used during training:
58
+ - learning_rate: 1e-05
59
+ - train_batch_size: 16
60
+ - eval_batch_size: 8
61
  - seed: 42
 
 
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
64
+ - lr_scheduler_warmup_steps: 100
65
+ - training_steps: 1000
66
+ - mixed_precision_training: Native AMP
67
+
68
+ ### Training results
69
+
70
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
71
+ |:-------------:|:------:|:----:|:---------------:|:-------:|
72
+ | 0.8572 | 0.4785 | 100 | 0.4851 | 76.0692 |
73
+ | 0.4 | 0.9569 | 200 | 0.3592 | 64.0582 |
74
+ | 0.2329 | 1.4354 | 300 | 0.3153 | 56.2329 |
75
+ | 0.2098 | 1.9139 | 400 | 0.2918 | 53.5032 |
76
+ | 0.1189 | 2.3923 | 500 | 0.2865 | 51.4104 |
77
+ | 0.096 | 2.8708 | 600 | 0.2835 | 50.7734 |
78
+ | 0.0565 | 3.3493 | 700 | 0.2984 | 50.9554 |
79
+ | 0.0425 | 3.8278 | 800 | 0.2947 | 48.7716 |
80
+ | 0.027 | 4.3062 | 900 | 0.3007 | 49.4995 |
81
+ | 0.0174 | 4.7847 | 1000 | 0.3013 | 48.0437 |
82
+
83
 
84
  ### Framework versions
85
 
86
+ - Transformers 4.44.2
87
+ - Pytorch 2.4.0+cu121
88
+ - Datasets 2.21.0
89
  - Tokenizers 0.19.1
generation_config.json CHANGED
@@ -252,5 +252,5 @@
252
  "transcribe": 50359,
253
  "translate": 50358
254
  },
255
- "transformers_version": "4.44.0"
256
  }
 
252
  "transcribe": 50359,
253
  "translate": 50358
254
  },
255
+ "transformers_version": "4.44.2"
256
  }