p4b
/

whisper-large-v2-lv

@@ -1,42 +1,38 @@
 ---
-language:
-- lv
 license: apache-2.0
 tags:
-- whisper-event
-- hf-asr-leaderboard
 - generated_from_trainer
 datasets:
-- mozilla-foundation/common_voice_11_0
 metrics:
 - wer
 model-index:
-- name: Whisper Large-v2 Latvian
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: mozilla-foundation/common_voice_11_0 lv
-      type: mozilla-foundation/common_voice_11_0
       config: lv
       split: test
       args: lv
     metrics:
     - name: Wer
       type: wer
-      value: 27.47628083491461
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Large-v2 Latvian
-This model is a fine-tuned version of [openai/whisper-large-v2](https://huggingface.co/openai/whisper-large-v2) on the mozilla-foundation/common_voice_11_0 lv dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3179
-- Wer: 27.4763
 ## Model description
@@ -55,27 +51,26 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-07
-- train_batch_size: 64
 - eval_batch_size: 32
 - seed: 42
 - distributed_type: multi-GPU
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- lr_scheduler_warmup_steps: 200
-- training_steps: 1500
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.5148        | 3.01  | 200  | 0.4189          | 39.3454 |
-| 0.3041        | 6.03  | 400  | 0.3335          | 29.5731 |
-| 0.1961        | 9.04  | 600  | 0.3186          | 27.7799 |
-| 0.2579        | 13.01 | 800  | 0.3167          | 27.5712 |
-| 0.2034        | 16.03 | 1000 | 0.3179          | 27.4763 |
-| 0.1478        | 19.04 | 1200 | 0.3193          | 27.5237 |
-| 0.2169        | 23.01 | 1400 | 0.3198          | 27.5047 |
 ### Framework versions

 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
+- common_voice_11_0
 metrics:
 - wer
 model-index:
+- name: p4b/whisper-large-v2-lv
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: common_voice_11_0
+      type: common_voice_11_0
       config: lv
       split: test
       args: lv
     metrics:
     - name: Wer
       type: wer
+      value: 19.97153700189753
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# p4b/whisper-large-v2-lv
+This model is a fine-tuned version of [p4b/whisper-large-v2-lv](https://huggingface.co/p4b/whisper-large-v2-lv) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2593
+- Wer: 19.9715
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-07
+- train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - distributed_type: multi-GPU
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- lr_scheduler_warmup_steps: 100
+- training_steps: 900
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.7919        | 3.03  | 200  | 0.2793          | 22.5806 |
+| 0.4409        | 6.05  | 400  | 0.2651          | 20.6072 |
+| 0.4393        | 10.01 | 600  | 0.2600          | 20.0664 |
+| 0.4975        | 13.04 | 800  | 0.2593          | 19.9715 |
 ### Framework versions