humeur
/

whisper-small-sv-en

Automatic Speech Recognition

hf-asr-leaderboard

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

humeur commited on Dec 12, 2022

Commit

98fd040

•

1 Parent(s): 0db7791

Update README.md

Files changed (1) hide show

README.md +32 -5

README.md CHANGED Viewed

@@ -1,23 +1,40 @@
 ---
 language:
-- sv
 license: apache-2.0
 tags:
 - hf-asr-leaderboard
 - generated_from_trainer
 datasets:
-- mozilla-foundati/common_voice_11_0
 model-index:
-- name: Whisper Small sv-SE to en
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small sv-SE to en
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 ## Model description
@@ -46,6 +63,16 @@ The following hyperparameters were used during training:
 - training_steps: 4000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.26.0.dev0

 ---
 language:
+- se
 license: apache-2.0
 tags:
 - hf-asr-leaderboard
 - generated_from_trainer
 datasets:
+- mozilla-foundation/common_voice_11_0
+metrics:
+- wer
 model-index:
+- name: Whisper Small sv-SE - KTH
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice 11.0
+      type: mozilla-foundation/common_voice_11_0
+      config: sv
+      split: test
+    metrics:
+    - name: Wer
+      type: wer
+      value: 19.11929903392496
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Small sv-SE - KTH
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3310
+- Wer: 19.1193
 ## Model description
 - training_steps: 4000
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.1015        | 1.29  | 1000 | 0.2880          | 20.4134 |
+| 0.0387        | 2.59  | 2000 | 0.2959          | 19.6810 |
+| 0.0126        | 3.88  | 3000 | 0.3103          | 19.2990 |
+| 0.0035        | 5.17  | 4000 | 0.3310          | 19.1193 |
 ### Framework versions
 - Transformers 4.26.0.dev0