agnesluhtaru commited on
Commit
3e855c4
1 Parent(s): 25950fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -1
README.md CHANGED
@@ -2,6 +2,7 @@
2
  license: apache-2.0
3
  tags:
4
  - generated_from_trainer
 
5
  metrics:
6
  - wer
7
  model-index:
@@ -21,4 +22,40 @@ model-index:
21
  name: WER
22
  ---
23
 
24
- #TODO
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  tags:
4
  - generated_from_trainer
5
+ - whisper-event
6
  metrics:
7
  - wer
8
  model-index:
 
22
  name: WER
23
  ---
24
 
25
+ # whisper-small-et
26
+
27
+ This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the following datasets: Common Voice 11, VoxPopuli and FLEURS.
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ Estonian data from Common Voice 11, VoxPopuli and FLEURS corpora as both training and validation sets. Tested on Common Voice 11 test set.
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 1e-05
47
+ - train_batch_size: 32
48
+ - eval_batch_size: 16
49
+ - seed: 42
50
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
+ - lr_scheduler_type: linear
52
+ - lr_scheduler_warmup_steps: 500
53
+ - training_steps: 5000
54
+ - mixed_precision_training: Native AMP
55
+
56
+ ### Framework versions
57
+
58
+ - Transformers 4.26.0.dev0
59
+ - Pytorch 1.12.1+rocm5.1.1
60
+ - Datasets 2.7.1.dev0
61
+ - Tokenizers 0.13.2