kneth90 commited on
Commit
c2cc006
·
verified ·
1 Parent(s): 2eff4d6

End of training

Browse files
README.md CHANGED
@@ -3,27 +3,38 @@ library_name: transformers
3
  language:
4
  - id
5
  license: apache-2.0
6
- base_model: openai/whisper-small
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
- - kneth90/temp_snamol
11
  metrics:
12
  - wer
13
  model-index:
14
- - name: Whisper SMALL Snamol
15
- results: []
 
 
 
 
 
 
 
 
 
 
 
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
19
  should probably proofread and complete it, then remove this comment. -->
20
 
21
- # Whisper SMALL Snamol
22
 
23
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
24
  It achieves the following results on the evaluation set:
25
- - Loss: 1.3514
26
- - Wer: 50.0
27
 
28
  ## Model description
29
 
@@ -43,29 +54,30 @@ More information needed
43
 
44
  The following hyperparameters were used during training:
45
  - learning_rate: 1e-05
46
- - train_batch_size: 32
47
  - eval_batch_size: 8
48
  - seed: 42
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
  - lr_scheduler_warmup_steps: 500
52
- - training_steps: 5000
53
  - mixed_precision_training: Native AMP
54
 
55
  ### Training results
56
 
57
- | Training Loss | Epoch | Step | Validation Loss | Wer |
58
- |:-------------:|:------:|:----:|:---------------:|:-------:|
59
- | 0.0 | 1000.0 | 1000 | 0.9725 | 35.7143 |
60
- | 0.0 | 2000.0 | 2000 | 1.1105 | 42.8571 |
61
- | 0.0 | 3000.0 | 3000 | 1.2208 | 42.8571 |
62
- | 0.0 | 4000.0 | 4000 | 1.3330 | 50.0 |
63
- | 0.0 | 5000.0 | 5000 | 1.3514 | 50.0 |
 
64
 
65
 
66
  ### Framework versions
67
 
68
- - Transformers 4.47.1
69
- - Pytorch 2.5.1+cu121
70
  - Datasets 3.2.0
71
  - Tokenizers 0.21.0
 
3
  language:
4
  - id
5
  license: apache-2.0
6
+ base_model: kneth90/whisper-small-id-kp
7
  tags:
8
  - generated_from_trainer
9
  datasets:
10
+ - kneth90/snamol_temp
11
  metrics:
12
  - wer
13
  model-index:
14
+ - name: WHISPER Small KP
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 11.0
21
+ type: kneth90/snamol_temp
22
+ args: 'config: hi, split: test'
23
+ metrics:
24
+ - name: Wer
25
+ type: wer
26
+ value: 814.2857142857142
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
30
  should probably proofread and complete it, then remove this comment. -->
31
 
32
+ # WHISPER Small KP
33
 
34
+ This model is a fine-tuned version of [kneth90/whisper-small-id-kp](https://huggingface.co/kneth90/whisper-small-id-kp) on the Common Voice 11.0 dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 1.4420
37
+ - Wer: 814.2857
38
 
39
  ## Model description
40
 
 
54
 
55
  The following hyperparameters were used during training:
56
  - learning_rate: 1e-05
57
+ - train_batch_size: 16
58
  - eval_batch_size: 8
59
  - seed: 42
60
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
61
  - lr_scheduler_type: linear
62
  - lr_scheduler_warmup_steps: 500
63
+ - training_steps: 6000
64
  - mixed_precision_training: Native AMP
65
 
66
  ### Training results
67
 
68
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
69
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
70
+ | 0.0 | 1000.0 | 1000 | 0.5975 | 7.1429 |
71
+ | 0.0 | 2000.0 | 2000 | 0.8193 | 14.2857 |
72
+ | 0.0 | 3000.0 | 3000 | 1.0415 | 50.0 |
73
+ | 0.0 | 4000.0 | 4000 | 1.3226 | 50.0 |
74
+ | 0.0 | 5000.0 | 5000 | 1.3835 | 42.8571 |
75
+ | 0.0 | 6000.0 | 6000 | 1.4420 | 814.2857 |
76
 
77
 
78
  ### Framework versions
79
 
80
+ - Transformers 4.48.1
81
+ - Pytorch 2.5.1+cu124
82
  - Datasets 3.2.0
83
  - Tokenizers 0.21.0
generation_config.json CHANGED
@@ -250,5 +250,5 @@
250
  "transcribe": 50359,
251
  "translate": 50358
252
  },
253
- "transformers_version": "4.47.1"
254
  }
 
250
  "transcribe": 50359,
251
  "translate": 50358
252
  },
253
+ "transformers_version": "4.48.1"
254
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6abc6e1a39534ae86f57ee992a283a1600a4f6100f3cccc190ce804f7f61225f
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c52b0a41d1a29247605de913d6617ffe1d2838c47b2d148efb0f8371d45797bf
3
  size 966995080
runs/Jan30_17-00-35_msi/events.out.tfevents.1738231239.msi.4947.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f8cbde0819313090d0100fff8bae5d130a0fe78153e549cd101c537534795afb
3
- size 50588
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ab55d8f9b0270ae1a48c68f1682c5281400cc9748d47cddb72d259302695cd1e
3
+ size 59700