nutella-toast commited on
Commit
2292421
1 Parent(s): c890c6f

End of training

Browse files
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Wer
24
  type: wer
25
- value: 0.940809968847352
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,12 +32,12 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [nutella-toast/wav2vec2-large-xls-r-ssw](https://huggingface.co/nutella-toast/wav2vec2-large-xls-r-ssw) on the ml-superb-subset dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.0000
36
- - Wer: 0.9408
37
 
38
  ## Model description
39
 
40
- Finetuned version of vanilla Wav2Vec2 for CS224S.
41
 
42
  ## Intended uses & limitations
43
 
@@ -61,17 +61,22 @@ The following hyperparameters were used during training:
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 500
64
- - num_epochs: 5
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:------:|:----:|:---------------:|:------:|
71
- | 1.3831 | 1.0471 | 100 | 1.3053 | 1.0 |
72
- | 1.2606 | 2.0942 | 200 | 1.1802 | 0.9720 |
73
- | 1.0789 | 3.1414 | 300 | 1.0889 | 1.0405 |
74
- | 0.9249 | 4.1885 | 400 | 1.0000 | 0.9408 |
 
 
 
 
 
75
 
76
 
77
  ### Framework versions
 
22
  metrics:
23
  - name: Wer
24
  type: wer
25
+ value: 0.7320872274143302
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [nutella-toast/wav2vec2-large-xls-r-ssw](https://huggingface.co/nutella-toast/wav2vec2-large-xls-r-ssw) on the ml-superb-subset dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.7327
36
+ - Wer: 0.7321
37
 
38
  ## Model description
39
 
40
+ More information needed
41
 
42
  ## Intended uses & limitations
43
 
 
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 500
64
+ - num_epochs: 10
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:------:|:----:|:---------------:|:------:|
71
+ | 0.5779 | 1.0471 | 100 | 0.7902 | 0.8785 |
72
+ | 0.5307 | 2.0942 | 200 | 0.8185 | 0.8660 |
73
+ | 0.4826 | 3.1414 | 300 | 0.8378 | 0.8692 |
74
+ | 0.4529 | 4.1885 | 400 | 0.8048 | 0.9097 |
75
+ | 0.5053 | 5.2356 | 500 | 0.9541 | 0.8910 |
76
+ | 0.4149 | 6.2827 | 600 | 0.7687 | 0.7913 |
77
+ | 0.3179 | 7.3298 | 700 | 0.7678 | 0.7850 |
78
+ | 0.2642 | 8.3770 | 800 | 0.7151 | 0.7321 |
79
+ | 0.2147 | 9.4241 | 900 | 0.7327 | 0.7321 |
80
 
81
 
82
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:006e35e6236ac98764a41f080da74acff71fdca35a779668662573eb51b5a945
3
  size 1261938680
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bda6d8dee179d14dfa7ced6c486a54cf1c08eddf37a130782b764e22794bc6ff
3
  size 1261938680
runs/May21_05-49-57_ca1ac2ddf6a5/events.out.tfevents.1716270668.ca1ac2ddf6a5.619.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:01598ec021806b37238cdcc9be78c4884ac8cfb4064355910b136a8421401512
3
- size 12790
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5b4877b2319df1bb7d4c29fc5825627a4c288ac837a2b35f5219f3b030b7b53
3
+ size 13355