mprzibilla
/

super_large_finetune_CM01

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

mprzibilla commited on Oct 20, 2022

Commit

345c7a5

•

1 Parent(s): e8f94fb

update model card README.md

Files changed (1) hide show

README.md +18 -19

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
-license: apache-2.0
 tags:
 - generated_from_trainer
 model-index:
@@ -12,10 +11,10 @@ should probably proofread and complete it, then remove this comment. -->
 # super_large_finetune_CM01
-This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5380
-- Wer: 1.0
 ## Model description
@@ -35,29 +34,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
-- train_batch_size: 20
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 16065
-- num_epochs: 100
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step   | Validation Loss | Wer |
-|:-------------:|:-----:|:------:|:---------------:|:---:|
-| 13.2507       | 10.0  | 32130  | 2.7423          | 1.0 |
-| 2.0325        | 20.0  | 64260  | 2.6040          | 1.0 |
-| 1.9596        | 30.0  | 96390  | 2.5728          | 1.0 |
-| 1.9302        | 40.0  | 128520 | 2.5720          | 1.0 |
-| 1.9144        | 50.0  | 160650 | 2.5551          | 1.0 |
-| 1.9043        | 60.0  | 192780 | 2.5536          | 1.0 |
-| 1.8969        | 70.0  | 224910 | 2.5371          | 1.0 |
-| 1.8927        | 80.0  | 257040 | 2.5431          | 1.0 |
-| 1.8904        | 90.0  | 289170 | 2.5383          | 1.0 |
-| 1.8892        | 100.0 | 321300 | 2.5380          | 1.0 |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 model-index:
 # super_large_finetune_CM01
+This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 7.2285
+- Wer: 0.7714
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
+- train_batch_size: 15
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 857
+- num_epochs: 50
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Wer    |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|
+| 1.0031        | 5.0   | 1715  | 1.9766          | 0.7857 |
+| 0.2107        | 10.0  | 3430  | 3.8748          | 0.8238 |
+| 0.1393        | 15.0  | 5145  | 4.7403          | 0.7952 |
+| 0.0931        | 20.0  | 6860  | 3.5077          | 0.6667 |
+| 0.0649        | 25.0  | 8575  | 7.7419          | 0.9333 |
+| 0.0592        | 30.0  | 10290 | 5.6440          | 0.7762 |
+| 0.0396        | 35.0  | 12005 | 6.9629          | 0.6810 |
+| 0.03          | 40.0  | 13720 | 7.8282          | 0.7524 |
+| 0.0191        | 45.0  | 15435 | 6.4626          | 0.7429 |
+| 0.0121        | 50.0  | 17150 | 7.2285          | 0.7714 |
 ### Framework versions