mirodil commited on
Commit
ebda0e6
1 Parent(s): 1c77b8d

trained on CommonVoice 13

Browse files
README.md CHANGED
@@ -4,16 +4,29 @@ language:
4
  license: apache-2.0
5
  base_model: openai/whisper-tiny
6
  tags:
 
7
  - automatic-speech-recognition
8
  - generated_from_trainer
 
 
9
  metrics:
10
  - wer
11
  model-index:
12
  - name: Whisper Tiny Uzbek
13
- results: []
14
- datasets:
15
- - mozilla-foundation/common_voice_13_0
16
- pipeline_tag: automatic-speech-recognition
 
 
 
 
 
 
 
 
 
 
17
  ---
18
 
19
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -23,9 +36,9 @@ should probably proofread and complete it, then remove this comment. -->
23
 
24
  This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the mozilla-foundation/common_voice_13_0 dataset.
25
  It achieves the following results on the evaluation set:
26
- - Loss: 0.3775
27
- - Wer Ortho: 56.3536
28
- - Wer: 45.8937
29
 
30
  ## Model description
31
 
@@ -49,21 +62,20 @@ The following hyperparameters were used during training:
49
  - eval_batch_size: 16
50
  - seed: 42
51
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
52
- - lr_scheduler_type: constant_with_warmup
53
  - lr_scheduler_warmup_steps: 50
54
- - num_epochs: 1
55
 
56
  ### Training results
57
 
58
- | Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer |
59
- |:-------------:|:-----:|:----:|:---------------:|:---------:|:-------:|
60
- | 0.6542 | 0.13 | 500 | 0.6243 | 76.5585 | 67.7862 |
61
- | 0.5377 | 0.27 | 1000 | 0.5227 | 68.8556 | 60.2594 |
62
- | 0.4573 | 0.4 | 1500 | 0.4727 | 66.7551 | 56.0715 |
63
- | 0.4353 | 0.53 | 2000 | 0.4380 | 62.1211 | 52.5453 |
64
- | 0.3907 | 0.66 | 2500 | 0.4159 | 61.1252 | 50.8035 |
65
- | 0.4122 | 0.8 | 3000 | 0.3897 | 58.2628 | 47.8918 |
66
- | 0.3698 | 0.93 | 3500 | 0.3775 | 56.3536 | 45.8937 |
67
 
68
 
69
  ### Framework versions
@@ -71,4 +83,4 @@ The following hyperparameters were used during training:
71
  - Transformers 4.35.1
72
  - Pytorch 2.1.0
73
  - Datasets 2.14.6
74
- - Tokenizers 0.14.1
 
4
  license: apache-2.0
5
  base_model: openai/whisper-tiny
6
  tags:
7
+ - audio
8
  - automatic-speech-recognition
9
  - generated_from_trainer
10
+ datasets:
11
+ - audio
12
  metrics:
13
  - wer
14
  model-index:
15
  - name: Whisper Tiny Uzbek
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: mozilla-foundation/common_voice_13_0
22
+ type: audio
23
+ config: uz
24
+ split: test
25
+ args: uz
26
+ metrics:
27
+ - name: Wer
28
+ type: wer
29
+ value: 36.79056163528213
30
  ---
31
 
32
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
36
 
37
  This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the mozilla-foundation/common_voice_13_0 dataset.
38
  It achieves the following results on the evaluation set:
39
+ - Loss: 0.2981
40
+ - Wer Ortho: 47.7812
41
+ - Wer: 36.7906
42
 
43
  ## Model description
44
 
 
62
  - eval_batch_size: 16
63
  - seed: 42
64
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
+ - lr_scheduler_type: cosine_with_restarts
66
  - lr_scheduler_warmup_steps: 50
67
+ - num_epochs: 5
68
 
69
  ### Training results
70
 
71
+ | Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer |
72
+ |:-------------:|:-----:|:-----:|:---------------:|:---------:|:-------:|
73
+ | 0.2929 | 0.8 | 3000 | 0.3281 | 50.8851 | 40.4395 |
74
+ | 0.2194 | 1.59 | 6000 | 0.3110 | 49.2325 | 37.9320 |
75
+ | 0.177 | 2.39 | 9000 | 0.3003 | 47.8700 | 36.8366 |
76
+ | 0.1574 | 3.18 | 12000 | 0.2997 | 48.2291 | 37.0491 |
77
+ | 0.1524 | 3.98 | 15000 | 0.2958 | 47.2395 | 36.4400 |
78
+ | 0.1455 | 4.77 | 18000 | 0.2981 | 47.7812 | 36.7906 |
 
79
 
80
 
81
  ### Framework versions
 
83
  - Transformers 4.35.1
84
  - Pytorch 2.1.0
85
  - Datasets 2.14.6
86
+ - Tokenizers 0.14.1
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3b2adc980edbd221cd30f2cb40caac62549c99e3eb736d422292c3c5a9328d0b
3
  size 151061672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32770cd750470731caac857e7f1ef4b4733716940bda6164c8c33766d0c4b987
3
  size 151061672
runs/Nov26_00-28-37_L67DDV9G7R/events.out.tfevents.1700987323.L67DDV9G7R.63068.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0ed3d6d150e26d9b441182211bd8c3895ee9b53e8ccc8c4673f859c75ce0ce5
3
+ size 127271
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:445459a2541e1ef8a2633a1a0e8fb12d8697e4b16fdf2b2d5e219d2f10599ecd
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6475deeca5ff756eef35b1bf90f1a9620fcb5bb2663a5548a9310de3e805131
3
  size 4728