tgrhn commited on
Commit
e6e1981
1 Parent(s): 8356599

End of training

Browse files
Files changed (1) hide show
  1. README.md +83 -0
README.md ADDED
@@ -0,0 +1,83 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - tr
4
+ license: apache-2.0
5
+ base_model: openai/whisper-large-v2
6
+ tags:
7
+ - whisper-event
8
+ - generated_from_trainer
9
+ datasets:
10
+ - mozilla-foundation/common_voice_16_1
11
+ metrics:
12
+ - wer
13
+ model-index:
14
+ - name: 'Whisper Small Tr '
15
+ results:
16
+ - task:
17
+ name: Automatic Speech Recognition
18
+ type: automatic-speech-recognition
19
+ dataset:
20
+ name: Common Voice 16.1
21
+ type: mozilla-foundation/common_voice_16_1
22
+ config: tr
23
+ split: None
24
+ args: tr
25
+ metrics:
26
+ - name: Wer
27
+ type: wer
28
+ value: 18.987030332852974
29
+ ---
30
+
31
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
32
+ should probably proofread and complete it, then remove this comment. -->
33
+
34
+ # Whisper Small Tr
35
+
36
+ This model is a fine-tuned version of [openai/whisper-large-v2](https://huggingface.co/openai/whisper-large-v2) on the Common Voice 16.1 dataset.
37
+ It achieves the following results on the evaluation set:
38
+ - Loss: 0.2550
39
+ - Wer: 18.9870
40
+
41
+ ## Model description
42
+
43
+ More information needed
44
+
45
+ ## Intended uses & limitations
46
+
47
+ More information needed
48
+
49
+ ## Training and evaluation data
50
+
51
+ More information needed
52
+
53
+ ## Training procedure
54
+
55
+ ### Training hyperparameters
56
+
57
+ The following hyperparameters were used during training:
58
+ - learning_rate: 1e-05
59
+ - train_batch_size: 64
60
+ - eval_batch_size: 32
61
+ - seed: 42
62
+ - gradient_accumulation_steps: 2
63
+ - total_train_batch_size: 128
64
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
+ - lr_scheduler_type: linear
66
+ - lr_scheduler_warmup_steps: 500
67
+ - training_steps: 1000
68
+ - mixed_precision_training: Native AMP
69
+
70
+ ### Training results
71
+
72
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
73
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|
74
+ | 0.954 | 1.46 | 500 | 0.2702 | 20.1768 |
75
+ | 0.143 | 2.92 | 1000 | 0.2550 | 18.9870 |
76
+
77
+
78
+ ### Framework versions
79
+
80
+ - Transformers 4.38.1
81
+ - Pytorch 2.2.0+cu121
82
+ - Datasets 2.17.0
83
+ - Tokenizers 0.15.2