jcrkn
/

wav2vec2-large-xls-r-300m-breton-colab_batch

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

jcrkn commited on Aug 29, 2023

Commit

0767d52

•

1 Parent(s): a1bfb4f

End of training

Files changed (2) hide show

README.md +10 -10
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 0.6405002405002405
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8613
-- Wer: 0.6405
 ## Model description
@@ -53,24 +53,24 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 250
 - num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 5.921         | 0.84  | 250  | 2.9989          | 1.0    |
-| 1.7514        | 1.68  | 500  | 1.2574          | 0.8484 |
-| 0.9455        | 2.52  | 750  | 0.9508          | 0.7119 |
-| 0.674         | 3.36  | 1000 | 0.8613          | 0.6405 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 0.751034151034151
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0089
+- Wer: 0.7510
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 125
 - num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 7.2591        | 0.84  | 125  | 3.0754          | 1.0    |
+| 2.9952        | 1.68  | 250  | 2.8232          | 1.0    |
+| 1.9197        | 2.52  | 375  | 1.3326          | 0.8705 |
+| 1.0542        | 3.36  | 500  | 1.0089          | 0.7510 |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e9c2fbbd23af4b47a11b517f370cd35370464a8a5f8a076ff4a09239c1aff063
 size 1262078125

 version https://git-lfs.github.com/spec/v1
+oid sha256:05544c458e0008da3abd79c4c17a3e4c6736105a0bf46480fd2449c861754bb6
 size 1262078125