jayashreedevi2020
/

wav2vec2-large-xls-r-300m-assamese_speech_to_IPA

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-xls-r-300m
 datasets:
 - common_voice_11_0
 metrics:
@@ -11,8 +11,8 @@ model-index:
 - name: wav2vec2-large-xls-r-300m-assamese_speech_to_IPA
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_11_0
       type: common_voice_11_0
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: as
     metrics:
-    - type: wer
-      value: 0.5796
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7335
-- Wer: 0.5796
 ## Model description
@@ -61,15 +61,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer    |
 |:-------------:|:-------:|:----:|:---------------:|:------:|
-| 4.7433        | 9.8765  | 400  | 0.9355          | 0.7508 |
-| 0.2856        | 19.7531 | 800  | 0.7335          | 0.5796 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-xls-r-300m
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_11_0
 metrics:
 - name: wav2vec2-large-xls-r-300m-assamese_speech_to_IPA
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_11_0
       type: common_voice_11_0
       split: test
       args: as
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.5974643423137876
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0543
+- Wer: 0.5975
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 40
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss | Wer    |
 |:-------------:|:-------:|:----:|:---------------:|:------:|
+| 4.4763        | 9.8765  | 400  | 1.0898          | 0.8007 |
+| 0.3692        | 19.7531 | 800  | 0.9617          | 0.6628 |
+| 0.1187        | 29.6296 | 1200 | 1.0302          | 0.5990 |
+| 0.0659        | 39.5062 | 1600 | 1.0543          | 0.5975 |
 ### Framework versions