toshiouchiyama
/

whisper-small-ja

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

toshiouchiyama commited on Jan 13, 2023

Commit

9181caa

•

1 Parent(s): 4a919f9

update model card README.md

Files changed (1) hide show

README.md +12 -12

README.md CHANGED Viewed

@@ -1,7 +1,6 @@
 ---
 language:
 - ja
-license: apache-2.0
 tags:
 - hf-asr-leaderboard
 - generated_from_trainer
@@ -10,32 +9,31 @@ datasets:
 metrics:
 - wer
 model-index:
-- name: Whisper Small ja - Tohio Uchiyama
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: Special
       type: Specific-Person-Voice
       config: null
       split: None
-      args: 'config: ja, split: train'
     metrics:
     - name: Wer
       type: wer
-      value: 984.4311377245509
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small ja - Tohio Uchiyama
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Special dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9332
-- Wer: 984.4311
 ## Model description
@@ -61,14 +59,16 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
-- training_steps: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer      |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 2.0   | 10   | 1.1812          | 31.1377  |
-| No log        | 4.0   | 20   | 0.9332          | 984.4311 |
 ### Framework versions

 ---
 language:
 - ja
 tags:
 - hf-asr-leaderboard
 - generated_from_trainer
 metrics:
 - wer
 model-index:
+- name: Whisper Small Ja - Tohio Uchiyama
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: Specific-Person-Voice
       type: Specific-Person-Voice
       config: null
       split: None
     metrics:
     - name: Wer
       type: wer
+      value: 23.239436619718308
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Small Ja - Tohio Uchiyama
+This model is a fine-tuned version of [openai/whisper-small-ja](https://huggingface.co/openai/whisper-small-ja) on the Specific-Person-Voice dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7645
+- Wer: 23.2394
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 5
+- training_steps: 40
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer      |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.43  | 10   | 1.2124          | 21.1268  |
+| No log        | 2.86  | 20   | 0.8283          | 884.5070 |
+| 1.2688        | 4.29  | 30   | 0.7769          | 23.2394  |
+| 1.2688        | 5.71  | 40   | 0.7645          | 23.2394  |
 ### Framework versions