Neruzo
/

whisper-small-hi

@@ -1,12 +1,12 @@
 ---
 language:
-- hi
 license: apache-2.0
 base_model: openai/whisper-small
 tags:
 - generated_from_trainer
 datasets:
-- mozilla-foundation/common_voice_11_0
 metrics:
 - wer
 model-index:
@@ -16,15 +16,13 @@ model-index:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: Common Voice 11.0
-      type: mozilla-foundation/common_voice_11_0
-      config: hi
-      split: None
-      args: 'config: hi, split: test'
     metrics:
     - name: Wer
       type: wer
-      value: 56.36163548632862
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,10 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
 # Whisper Small Hi - Sanchit Gandhi
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5549
-- Wer: 56.3616
 ## Model description
@@ -55,21 +53,20 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
-- training_steps: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.2754        | 0.01  | 10   | 0.5915          | 58.5541 |
-| 0.4426        | 0.02  | 20   | 0.5549          | 56.3616 |
 ### Framework versions

 ---
 language:
+- vi
 license: apache-2.0
 base_model: openai/whisper-small
 tags:
 - generated_from_trainer
 datasets:
+- vivos
 metrics:
 - wer
 model-index:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: '##'
+      type: vivos
+      args: 'config: vi, split: test'
     metrics:
     - name: Wer
       type: wer
+      value: 97.2027972027972
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Whisper Small Hi - Sanchit Gandhi
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the ## dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3333
+- Wer: 97.2028
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 16
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
+- training_steps: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 1.6611        | 0.01  | 10   | 1.3333          | 97.2028 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -150,7 +150,7 @@
     "<|yo|>": 50325,
     "<|zh|>": 50260
   },
-  "language": "hindi",
   "max_initial_timestamp_index": 50,
   "max_length": 448,
   "no_timestamps_token_id": 50363,

     "<|yo|>": 50325,
     "<|zh|>": 50260
   },
+  "language": "vietnamese",
   "max_initial_timestamp_index": 50,
   "max_length": 448,
   "no_timestamps_token_id": 50363,