thennal commited on
Commit
fc1a7ae
1 Parent(s): eda8b16

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -33
README.md CHANGED
@@ -4,45 +4,29 @@ language:
4
  license: apache-2.0
5
  tags:
6
  - whisper-event
 
7
  datasets:
8
  - mozilla-foundation/common_voice_11_0
9
- - google/fleurs
10
- - thennal/IMaSC
11
- - thennal/ulca_ml
12
- - thennal/msc
13
- - thennal/indic_tts_ml
14
- metrics:
15
- - wer
16
  model-index:
17
- - name: "Whisper Medium Malayalam - Thennal D K"
18
- results:
19
- - task:
20
- name: Automatic Speech Recognition
21
- type: automatic-speech-recognition
22
- dataset:
23
- name: Common Voice 11.0
24
- type: mozilla-foundation/common_voice_11_0
25
- config: ml
26
- split: test
27
- args: ml
28
- metrics:
29
- - name: Wer
30
- type: wer
31
- value: 42.98850574712644
32
- - name: Cer
33
- type: cer
34
- value: 10.390585878818229
35
  ---
36
 
 
 
37
 
38
- # Whisper Medium Malayalam - Thennal D K
39
 
40
- This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on a combined dataset sourced from IMaSC,
41
- SMC, Indic TTS, FLEURS (train set), Common Voice 11 (train + other set), OpenSLR, and ULCA.
42
- It achieves the following results on the evaluation set (Common Voice 11 test split):
43
- - Loss: 0.0730
44
- - WER: 42.9886
45
- - CER: 10.3906
 
 
 
 
46
 
47
  ## Model description
48
 
@@ -68,7 +52,7 @@ The following hyperparameters were used during training:
68
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
69
  - lr_scheduler_type: linear
70
  - lr_scheduler_warmup_steps: 500
71
- - training_steps: 4000
72
  - mixed_precision_training: Native AMP
73
 
74
  ### Framework versions
 
4
  license: apache-2.0
5
  tags:
6
  - whisper-event
7
+ - generated_from_trainer
8
  datasets:
9
  - mozilla-foundation/common_voice_11_0
 
 
 
 
 
 
 
10
  model-index:
11
+ - name: Whisper Medium Malayalam
12
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
+ should probably proofread and complete it, then remove this comment. -->
17
 
18
+ # Whisper Medium Malayalam
19
 
20
+ This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the Common Voice 11.0 dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - eval_loss: 0.0833
23
+ - eval_wer: 43.6782
24
+ - eval_cer: 9.6895
25
+ - eval_runtime: 437.7464
26
+ - eval_samples_per_second: 0.256
27
+ - eval_steps_per_second: 0.016
28
+ - epoch: 3.59
29
+ - step: 7000
30
 
31
  ## Model description
32
 
 
52
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
53
  - lr_scheduler_type: linear
54
  - lr_scheduler_warmup_steps: 500
55
+ - training_steps: 8000
56
  - mixed_precision_training: Native AMP
57
 
58
  ### Framework versions