jcrkn
/

wav2vec2-large-xls-r-300m-breton-colab

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 0.7804713804713804
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3812
-- Wer: 0.7805
 ## Model description
@@ -52,29 +52,30 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
-- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 4.1427        | 3.12  | 400  | 1.6053          | 0.9481 |
-| 1.1814        | 6.25  | 800  | 1.4622          | 0.8634 |
-| 0.5771        | 9.38  | 1200 | 1.3812          | 0.7805 |
 ### Framework versions
-- Transformers 4.31.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.4
 - Tokenizers 0.13.3

     metrics:
     - name: Wer
       type: wer
+      value: 0.794035594035594
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4386
+- Wer: 0.7940
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0009
+- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 300
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.8198        | 1.17  | 300  | 1.5223          | 0.8765 |
+| 0.8458        | 2.35  | 600  | 1.4725          | 0.8534 |
+| 0.6537        | 3.52  | 900  | 1.4510          | 0.8080 |
+| 0.4878        | 4.7   | 1200 | 1.4386          | 0.7940 |
 ### Framework versions
+- Transformers 4.32.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.4
 - Tokenizers 0.13.3

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5a5d7a160d06473eafeea45781085b31ce628a7dd4f9ebfc1c6164baeb5d2f3d
 size 1262078125

 version https://git-lfs.github.com/spec/v1
+oid sha256:af8078de2c6f3bf6e14a2f9ea86fa4d53cddcfff2fdcd50f5849247f3defdd32
 size 1262078125