wav2vec2-kaggle-final

This model is a fine-tuned version of facebook/wav2vec2-xls-r-1b on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 467.6487
  • Wer: 0.3843
  • Cer: 0.1615

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 2
  • total_train_batch_size: 16
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 15
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Wer Cer
1980.0484 0.17 200 1881.8782 1.0 1.0
1857.5231 0.34 400 1847.1379 1.0 1.0
1333.847 0.52 600 993.8895 0.9733 0.4174
990.9649 0.69 800 772.9453 0.8053 0.2881
822.3509 0.86 1000 699.9114 0.7153 0.2577
853.1906 1.03 1200 588.0330 0.6488 0.2377
681.4223 1.21 1400 595.6282 0.6082 0.2213
680.3315 1.38 1600 533.0568 0.5844 0.2104
612.303 1.55 1800 530.6596 0.5911 0.2142
628.1575 1.72 2000 504.1773 0.5508 0.2025
594.902 1.89 2200 483.2587 0.5422 0.1984
615.5168 2.07 2400 482.7257 0.5196 0.1939
552.6289 2.24 2600 487.7754 0.5168 0.1929
585.4439 2.41 2800 548.5886 0.5305 0.1981
531.7139 2.58 3000 593.6019 0.5795 0.2327
497.7652 2.76 3200 679.6472 0.5966 0.2382
479.2212 2.93 3400 566.4340 0.5237 0.1963
532.7164 3.1 3600 510.3879 0.4811 0.1846
508.853 3.27 3800 499.0350 0.4631 0.1798
502.5701 3.44 4000 557.4500 0.4885 0.1914
466.4071 3.62 4200 504.6231 0.4503 0.1766
521.5373 3.79 4400 816.4122 0.6920 0.2705
521.7708 3.96 4600 542.0289 0.4748 0.1827
434.5645 4.13 4800 483.0985 0.4563 0.1773
478.0779 4.3 5000 573.2537 0.4733 0.1826
490.7463 4.48 5200 448.2722 0.4495 0.1769
420.8497 4.65 5400 575.9213 0.4622 0.1794
444.1535 4.82 5600 536.1183 0.4689 0.1831
417.559 4.99 5800 526.9570 0.4678 0.1793
462.6115 5.17 6000 553.8561 0.5061 0.1966
472.7057 5.34 6200 473.8315 0.4402 0.1738
474.7029 5.51 6400 574.5921 0.5107 0.2021
422.9327 5.68 6600 533.9930 0.4616 0.1770
454.3927 5.85 6800 485.6865 0.4312 0.1717
414.617 6.03 7000 477.8979 0.4185 0.1709
431.8193 6.2 7200 577.0553 0.4742 0.1834
457.5081 6.37 7400 599.4647 0.4889 0.1939
410.7263 6.54 7600 495.1211 0.4386 0.1749
409.3561 6.72 7800 558.3831 0.4671 0.1846
419.5535 6.89 8000 521.3979 0.4745 0.1880
416.9934 7.06 8200 625.8699 0.5109 0.1989
402.1264 7.23 8400 624.1210 0.4906 0.1943
401.3531 7.4 8600 577.6796 0.4601 0.1828
393.8983 7.58 8800 539.1487 0.4512 0.1793
404.0746 7.75 9000 681.1195 0.5745 0.2427
367.9037 7.92 9200 600.4460 0.4890 0.1933
412.4216 8.09 9400 515.9341 0.4482 0.1764
412.5161 8.27 9600 496.5386 0.4204 0.1710
386.1062 8.44 9800 564.3361 0.4805 0.1899
373.9701 8.61 10000 534.6701 0.4569 0.1807
386.8188 8.78 10200 569.4644 0.4458 0.1769
353.7339 8.95 10400 586.9103 0.4924 0.1937
348.9872 9.13 10600 584.7791 0.4717 0.1890
370.6689 9.3 10800 593.6466 0.4604 0.1794
373.6055 9.47 11000 517.7433 0.4176 0.1664
397.119 9.64 11200 517.0852 0.4353 0.1735
348.3093 9.81 11400 542.9993 0.4342 0.1750
392.0189 9.99 11600 518.7840 0.4343 0.1732
331.1382 10.16 11800 520.4122 0.4155 0.1677
333.4658 10.33 12000 562.0241 0.4370 0.1745
386.7848 10.5 12200 523.9895 0.4181 0.1685
334.7904 10.68 12400 508.0610 0.4249 0.1708
371.6553 10.85 12600 569.8074 0.4587 0.1826
356.7725 11.02 12800 512.5233 0.4124 0.1672
336.8186 11.19 13000 526.1908 0.4311 0.1725
361.0486 11.36 13200 513.0122 0.4213 0.1711
317.7533 11.54 13400 475.2107 0.4128 0.1670
410.9716 11.71 13600 449.9915 0.4006 0.1650
355.4503 11.88 13800 455.7285 0.3945 0.1639
356.4254 12.05 14000 497.9339 0.4050 0.1665
344.9908 12.23 14200 483.6688 0.4072 0.1674
347.5455 12.4 14400 480.7558 0.3973 0.1646
348.6302 12.57 14600 491.5127 0.4001 0.1658
324.9071 12.74 14800 499.9131 0.4052 0.1669
344.9909 12.91 15000 487.6642 0.3919 0.1637
271.2929 13.09 15200 490.5817 0.3936 0.1637
340.6752 13.26 15400 488.0888 0.4021 0.1653
332.469 13.43 15600 483.0107 0.4040 0.1677
379.9843 13.6 15800 464.2564 0.3887 0.1624
324.5989 13.78 16000 469.0699 0.3923 0.1626
342.6454 13.95 16200 466.5754 0.3822 0.1605
299.187 14.12 16400 488.1742 0.3911 0.1627
352.0391 14.29 16600 481.3880 0.3872 0.1612
292.35 14.46 16800 471.6863 0.3864 0.1620
333.7441 14.64 17000 469.7429 0.3879 0.1618
318.1604 14.81 17200 468.5455 0.3846 0.1616
284.3848 14.98 17400 467.6487 0.3843 0.1615

Framework versions

  • Transformers 4.38.2
  • Pytorch 2.2.0+cu118
  • Datasets 2.16.0
  • Tokenizers 0.15.2
Downloads last month
1
Safetensors
Model size
1.0B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for anvitamanne/small-final-model-ishaan

Finetuned
(126)
this model