TSukiLen
/

whisper-small-chinese-tw-minnan

@@ -1,40 +1,42 @@
 ---
 library_name: transformers
 license: apache-2.0
 base_model: openai/whisper-small
 tags:
 - generated_from_trainer
 datasets:
-- common_voice_11_0
 metrics:
 - wer
 model-index:
-- name: whisper-small-chinese-tw-minnan
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: common_voice_11_0
-      type: common_voice_11_0
       config: nan-tw
       split: test
-      args: nan-tw
     metrics:
     - name: Wer
       type: wer
-      value: 95.14713474445018
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# whisper-small-chinese-tw-minnan
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9675
-- Wer: 95.1471
 ## Model description
@@ -54,28 +56,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 4000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer     |
 |:-------------:|:-------:|:----:|:---------------:|:-------:|
-| 0.032         | 7.2464  | 1000 | 0.8407          | 96.5927 |
-| 0.0013        | 14.4928 | 2000 | 0.9247          | 95.3020 |
-| 0.0005        | 21.7391 | 3000 | 0.9552          | 94.9923 |
-| 0.0004        | 28.9855 | 4000 | 0.9675          | 95.1471 |
 ### Framework versions
-- Transformers 4.46.2
 - Pytorch 2.4.0+cu124
 - Datasets 3.1.0
 - Tokenizers 0.20.3

 ---
 library_name: transformers
+language:
+- zh
 license: apache-2.0
 base_model: openai/whisper-small
 tags:
 - generated_from_trainer
 datasets:
+- mozilla-foundation/common_voice_11_0
 metrics:
 - wer
 model-index:
+- name: Whisper Small chinese Test
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: Common Voice 11.0
+      type: mozilla-foundation/common_voice_11_0
       config: nan-tw
       split: test
+      args: 'config: zh-tw, split: test'
     metrics:
     - name: Wer
       type: wer
+      value: 94.0629839958699
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper Small chinese Test
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9213
+- Wer: 94.0630
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 5000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer     |
 |:-------------:|:-------:|:----:|:---------------:|:-------:|
+| 0.1069        | 3.6364  | 1000 | 0.7541          | 99.3289 |
+| 0.0117        | 7.2727  | 2000 | 0.8330          | 93.9597 |
+| 0.0015        | 10.9091 | 3000 | 0.8627          | 94.7858 |
+| 0.0004        | 14.5455 | 4000 | 0.9036          | 93.3918 |
+| 0.0002        | 18.1818 | 5000 | 0.9213          | 94.0630 |
 ### Framework versions
+- Transformers 4.46.3
 - Pytorch 2.4.0+cu124
 - Datasets 3.1.0
 - Tokenizers 0.20.3

generation_config.json CHANGED Viewed

@@ -250,5 +250,5 @@
     "transcribe": 50359,
     "translate": 50358
   },
-  "transformers_version": "4.46.2"
 }

     "transcribe": 50359,
     "translate": 50358
   },
+  "transformers_version": "4.46.3"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:10bfeb8df0515ea970c549d4c77f6811ddb673fc7ed55d478bda013f357cad68
 size 966995080

 version https://git-lfs.github.com/spec/v1
+oid sha256:e142936c541af623b5593c82ccf3e92068bdc346712a4820d20abbe11a2b1afd
 size 966995080