AshtonLKY
/

Whisper_ASR_ATC_v2

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

AshtonLKY commited on Jan 21

Commit

a1e2a60

•

1 Parent(s): 8e9e241

End of training

Files changed (1) hide show

README.md +33 -9

README.md CHANGED Viewed

@@ -8,9 +8,22 @@ tags:
 - generated_from_trainer
 datasets:
 - AshtonLKY/Whisper_ASR_ATC
 model-index:
 - name: Whisper_ASR_ATC
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -20,13 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the AshtonLKY/augmented_audio dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 0.1061
-- eval_wer: 10.8325
-- eval_runtime: 7410.9891
-- eval_samples_per_second: 1.813
-- eval_steps_per_second: 0.227
-- epoch: 0.89
-- step: 3000
 ## Model description
@@ -52,9 +60,25 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 6000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.36.2

 - generated_from_trainer
 datasets:
 - AshtonLKY/Whisper_ASR_ATC
+metrics:
+- wer
 model-index:
 - name: Whisper_ASR_ATC
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: AshtonLKY/augmented_audio
+      type: AshtonLKY/Whisper_ASR_ATC
+      args: 'split: test'
+    metrics:
+    - name: Wer
+      type: wer
+      value: 10.259091588129461
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the AshtonLKY/augmented_audio dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0171
+- Wer: 10.2591
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- training_steps: 10000
 - mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Wer     |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|
+| 0.2282        | 0.3   | 1000  | 0.2253          | 49.5224 |
+| 0.1461        | 0.6   | 2000  | 0.1456          | 42.3271 |
+| 0.1052        | 0.89  | 3000  | 0.1061          | 10.8325 |
+| 0.0698        | 1.19  | 4000  | 0.0708          | 13.8258 |
+| 0.043         | 1.49  | 5000  | 0.0537          | 11.0072 |
+| 0.0407        | 1.79  | 6000  | 0.0383          | 10.9401 |
+| 0.019         | 2.08  | 7000  | 0.0349          | 15.2078 |
+| 0.0323        | 2.38  | 8000  | 0.0268          | 11.4068 |
+| 0.0164        | 2.68  | 9000  | 0.0236          | 12.3902 |
+| 0.0153        | 2.98  | 10000 | 0.0171          | 10.2591 |
 ### Framework versions
 - Transformers 4.36.2