DewiBrynJones
/

whisper-large-v3-ft-cy-en

@@ -6,12 +6,8 @@ tags:
 metrics:
 - wer
 model-index:
-- name: whisper-large-v3-ft-cy
   results: []
-language:
-- cy
-- en
-pipeline_tag: automatic-speech-recognition
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,12 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-large-v3-ft-cy-en
-This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on Welsh
-and English bilingual data originally from Mozilla's Common Voice dataset (see: [techiaith/commonvoice_16_1_en_cy](https://huggingface.co/datasets/techiaith/commonvoice_16_1_en_cy)).
 It achieves the following results on the evaluation set:
-- Loss: 0.1480
-- Wer: 25.1341
 ## Model description
@@ -45,7 +39,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 4
-- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 8
 - total_train_batch_size: 32
@@ -56,17 +50,17 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer     |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.2078        | 0.25  | 1000 | 0.2198          | 28.7556 |
-| 0.1623        | 0.5   | 2000 | 0.1800          | 31.3698 |
-| 0.1417        | 0.75  | 3000 | 0.1585          | 18.7051 |
-| 0.1188        | 1.01  | 4000 | 0.1480          | 25.1341 |
 ### Framework versions
-- Transformers 4.37.1
-- Pytorch 2.1.2+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.1

 metrics:
 - wer
 model-index:
+- name: whisper-large-v3-ft-cy-en
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # whisper-large-v3-ft-cy-en
+This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1505
+- Wer: 9.5594
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 4
+- eval_batch_size: 1
 - seed: 42
 - gradient_accumulation_steps: 8
 - total_train_batch_size: 32
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer     |
+|:-------------:|:------:|:----:|:---------------:|:-------:|
+| 0.2097        | 0.2497 | 1000 | 0.2169          | 14.2221 |
+| 0.1621        | 0.4993 | 2000 | 0.1816          | 11.6845 |
+| 0.1406        | 0.7490 | 3000 | 0.1609          | 10.2445 |
+| 0.1242        | 0.9987 | 4000 | 0.1505          | 9.5594  |
 ### Framework versions
+- Transformers 4.41.2
+- Pytorch 2.3.1+cu121
+- Datasets 2.20.0
+- Tokenizers 0.19.1

generation_config.json CHANGED Viewed

@@ -55,7 +55,7 @@
     ],
     [
       2,
-      50359
     ]
   ],
   "is_multilingual": true,
@@ -261,5 +261,5 @@
     "transcribe": 50360,
     "translate": 50359
   },
-  "transformers_version": "4.37.1"
 }

     ],
     [
       2,
+      50360
     ]
   ],
   "is_multilingual": true,
     "transcribe": 50360,
     "translate": 50359
   },
+  "transformers_version": "4.41.2"
 }

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6686fe3c7690f628e5ab073f06e026348ba174fb5fdd60bb29da596b5b876cda
 size 4993448880

 version https://git-lfs.github.com/spec/v1
+oid sha256:eeac4c7c783ddd9a08930830ff734432479bc59efa7a96c546955f80e1cb192b
 size 4993448880

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d6beb4de5b782026f3ae2233a375c9c67631bb4cbf13783a2afdbf8b4441293
 size 1180663192

 version https://git-lfs.github.com/spec/v1
+oid sha256:7d682caf14f5143a8bf317078201ed4111ef74d97eeccf9d63043c320dae16d6
 size 1180663192