blackhole33 commited on
Commit
3abeb28
1 Parent(s): 6ec0d3a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +88 -3
README.md CHANGED
@@ -1,3 +1,88 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+
3
+ language:
4
+ - uz
5
+ license: apache-2.0
6
+ base_model: openai/whisper-medium
7
+ tags:
8
+ - generated_from_trainer
9
+ datasets:
10
+ - mozilla-foundation/common_voice_17_0
11
+ metrics:
12
+ - wer
13
+ model-index:
14
+ - name: Whisper Medium UZB
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 17.0
21
+ type: mozilla-foundation/common_voice_17_0
22
+ config: uz
23
+ split: None
24
+ args: 'config: uz, split: test'
25
+ metrics:
26
+ - name: Wer
27
+ type: wer
28
+ value: 31.77905998468049
29
+
30
+ ---
31
+
32
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
33
+ should probably proofread and complete it, then remove this comment. -->
34
+
35
+ # Whisper Medium UZB
36
+
37
+ This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the Common Voice 17.0 dataset.
38
+ It achieves the following results on the evaluation set:
39
+ - Loss: 0.2859
40
+ - Wer: 31.7790
41
+
42
+ ## Model description
43
+
44
+ More information needed
45
+
46
+ ## Intended uses & limitations
47
+
48
+ More information needed
49
+
50
+ Founder: Rifat Mamayusupov
51
+
52
+ ## Training and evaluation data
53
+
54
+ More information needed
55
+
56
+ ## Training procedure
57
+
58
+ ### Training hyperparameters
59
+
60
+ The following hyperparameters were used during training:
61
+ - learning_rate: 1e-05
62
+ - train_batch_size: 16
63
+ - eval_batch_size: 8
64
+ - seed: 42
65
+ - gradient_accumulation_steps: 2
66
+ - total_train_batch_size: 32
67
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
68
+ - lr_scheduler_type: linear
69
+ - lr_scheduler_warmup_steps: 500
70
+ - training_steps: 4000
71
+ - mixed_precision_training: Native AMP
72
+
73
+ ### Training results
74
+
75
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
76
+ |:-------------:|:------:|:----:|:---------------:|:-------:|
77
+ | 0.5187 | 0.5392 | 1000 | 0.4935 | 44.1403 |
78
+ | 0.3423 | 1.0785 | 2000 | 0.4008 | 37.6948 |
79
+ | 0.3018 | 1.6177 | 3000 | 0.3739 | 36.3575 |
80
+ | 0.2401 | 2.1569 | 4000 | 0.2821 | 31.7791 |
81
+
82
+
83
+ ### Framework versions
84
+
85
+ - Transformers 4.41.2
86
+ - Pytorch 2.3.1+cu121
87
+ - Datasets 2.20.0
88
+ - Tokenizers 0.19.1