techiaith
/

whisper-large-v3-ft-cv-cy-en

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics

DewiBrynJones commited on Nov 2, 2024

Commit

b04062d

·

verified ·

1 Parent(s): 6621ea7

Model save

Files changed (2) hide show

README.md +14 -13
generation_config.json +1 -1

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 license: apache-2.0
 base_model: openai/whisper-large-v3
 tags:
@@ -15,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # whisper-large-v3-ft-cv-cy-en
-This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the DewiBrynJones/commonvoice_18_0_cy_en train main dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3036
-- Wer: 0.1826
 ## Model description
@@ -43,7 +44,7 @@ The following hyperparameters were used during training:
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
 - training_steps: 5000
@@ -53,16 +54,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Wer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
-| 0.294         | 0.7075 | 1000 | 0.3054          | 0.2114 |
-| 0.1572        | 1.4149 | 2000 | 0.2768          | 0.1898 |
-| 0.0714        | 2.1224 | 3000 | 0.2789          | 0.1807 |
-| 0.0772        | 2.8299 | 4000 | 0.2759          | 0.1810 |
-| 0.0337        | 3.5373 | 5000 | 0.3036          | 0.1826 |
 ### Framework versions
-- Transformers 4.44.0
-- Pytorch 2.4.0+cu121
-- Datasets 2.20.0
-- Tokenizers 0.19.1

 ---
+library_name: transformers
 license: apache-2.0
 base_model: openai/whisper-large-v3
 tags:
 # whisper-large-v3-ft-cv-cy-en
+This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2744
+- Wer: 0.1474
 ## Model description
 - seed: 42
 - gradient_accumulation_steps: 2
 - total_train_batch_size: 32
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
 - training_steps: 5000
 | Training Loss | Epoch  | Step | Validation Loss | Wer    |
 |:-------------:|:------:|:----:|:---------------:|:------:|
+| 0.4825        | 0.7075 | 1000 | 0.2708          | 0.1810 |
+| 0.2262        | 1.4149 | 2000 | 0.2486          | 0.1594 |
+| 0.0867        | 2.1224 | 3000 | 0.2506          | 0.1511 |
+| 0.0973        | 2.8299 | 4000 | 0.2444          | 0.1490 |
+| 0.0303        | 3.5373 | 5000 | 0.2744          | 0.1474 |
 ### Framework versions
+- Transformers 4.46.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.1.0
+- Tokenizers 0.20.1

generation_config.json CHANGED Viewed

@@ -253,5 +253,5 @@
     "transcribe": 50360,
     "translate": 50359
   },
-  "transformers_version": "4.44.0"
 }

     "transcribe": 50360,
     "translate": 50359
   },
+  "transformers_version": "4.46.1"
 }