ED_small_cv_v2
This model is a fine-tuned version of on the common_voice_13_0 dataset. It achieves the following results on the evaluation set:
- Loss: 1.0688
- Cer: 0.0677
- Wer: 0.1598
- Mer: 0.1565
- Wil: 0.2593
- Wip: 0.7407
- Hits: 127573
- Substitutions: 17637
- Deletions: 2971
- Insertions: 3069
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.001
- train_batch_size: 128
- eval_batch_size: 64
- seed: 42
- distributed_type: multi-GPU
- num_devices: 4
- total_train_batch_size: 512
- total_eval_batch_size: 256
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 15000
- num_epochs: 70.0
Training results
Training Loss | Epoch | Step | Cer | Deletions | Hits | Insertions | Validation Loss | Mer | Substitutions | Wer | Wil | Wip |
---|---|---|---|---|---|---|---|---|---|---|---|---|
1.7038 | 5.0 | 14885 | 0.1731 | 6463 | 103703 | 7394 | 1.5150 | 0.3334 | 38015 | 0.3501 | 0.5133 | 0.4867 |
1.6051 | 6.0 | 17862 | 0.1512 | 6309 | 107805 | 6257 | 1.4257 | 0.3020 | 34067 | 0.3147 | 0.4705 | 0.5295 |
1.5396 | 7.0 | 20839 | 0.1368 | 5262 | 110900 | 6029 | 1.3715 | 0.2809 | 32019 | 0.2923 | 0.4428 | 0.5572 |
1.4436 | 8.0 | 23816 | 0.1299 | 5464 | 112866 | 5686 | 1.3285 | 0.2665 | 29851 | 0.2767 | 0.4207 | 0.5793 |
1.4287 | 9.0 | 26793 | 0.1241 | 4793 | 114249 | 5680 | 1.3090 | 0.2575 | 29139 | 0.2673 | 0.4091 | 0.5909 |
1.395 | 10.0 | 29770 | 0.1200 | 4698 | 115580 | 5505 | 1.2842 | 0.2479 | 27903 | 0.2572 | 0.3949 | 0.6051 |
1.3668 | 11.0 | 32747 | 0.1111 | 4400 | 117281 | 4991 | 1.2510 | 0.2343 | 26500 | 0.2422 | 0.3761 | 0.6239 |
1.3238 | 12.0 | 35724 | 0.1064 | 4560 | 117845 | 4500 | 1.2363 | 0.2282 | 25776 | 0.2351 | 0.3673 | 0.6327 |
1.3133 | 13.0 | 38701 | 0.1028 | 4173 | 118971 | 4613 | 1.2215 | 0.2214 | 25037 | 0.2283 | 0.3573 | 0.6427 |
1.2968 | 14.0 | 41678 | 0.0995 | 3937 | 119798 | 4466 | 1.2026 | 0.2152 | 24446 | 0.2217 | 0.3487 | 0.6513 |
1.2783 | 15.0 | 44655 | 0.0974 | 4071 | 120231 | 4295 | 1.1939 | 0.2115 | 23879 | 0.2176 | 0.3427 | 0.6573 |
1.2359 | 16.0 | 47632 | 0.0961 | 3946 | 120640 | 4313 | 1.1884 | 0.2089 | 23595 | 0.2150 | 0.3388 | 0.6612 |
1.2543 | 17.0 | 50609 | 0.0939 | 3757 | 121623 | 4476 | 1.1743 | 0.2033 | 22801 | 0.2094 | 0.3296 | 0.6704 |
1.2245 | 18.0 | 53586 | 0.0919 | 3981 | 121522 | 3944 | 1.1690 | 0.2012 | 22678 | 0.2065 | 0.3273 | 0.6727 |
1.2 | 19.0 | 56563 | 0.0903 | 3819 | 122029 | 3995 | 1.1626 | 0.1981 | 22333 | 0.2034 | 0.3226 | 0.6774 |
1.1964 | 20.0 | 59540 | 0.0916 | 3822 | 122170 | 4154 | 1.1598 | 0.1980 | 22189 | 0.2036 | 0.3218 | 0.6782 |
1.1822 | 21.0 | 62517 | 0.0871 | 3630 | 122825 | 3981 | 1.1471 | 0.1928 | 21726 | 0.1980 | 0.3146 | 0.6854 |
1.1758 | 22.0 | 65494 | 0.0862 | 3556 | 123114 | 3918 | 1.1413 | 0.1906 | 21511 | 0.1956 | 0.3114 | 0.6886 |
1.1735 | 23.0 | 68471 | 0.0847 | 3431 | 123623 | 4013 | 1.1381 | 0.1877 | 21127 | 0.1928 | 0.3067 | 0.6933 |
1.1556 | 24.0 | 71448 | 0.0839 | 3668 | 123854 | 3698 | 1.1282 | 0.1845 | 20659 | 0.1891 | 0.3015 | 0.6985 |
1.1538 | 25.0 | 74425 | 0.0819 | 3475 | 124201 | 3716 | 1.1240 | 0.1823 | 20505 | 0.1869 | 0.2986 | 0.7014 |
1.1078 | 26.0 | 77402 | 0.0819 | 3410 | 124426 | 3751 | 1.1259 | 0.1810 | 20345 | 0.1856 | 0.2965 | 0.7035 |
1.1539 | 27.0 | 80379 | 0.0805 | 3333 | 124879 | 3716 | 1.1152 | 0.1779 | 19969 | 0.1823 | 0.2916 | 0.7084 |
1.1432 | 54.0 | 80406 | 1.1113 | 0.0787 | 0.1790 | 0.1747 | 0.2868 | 0.7132 | 125277 | 19604 | 3300 | 3619 |
1.1171 | 55.0 | 81895 | 1.0912 | 0.0744 | 0.1713 | 0.1676 | 0.2763 | 0.7237 | 126048 | 18870 | 3263 | 3245 |
1.1027 | 56.0 | 83384 | 1.0874 | 0.0740 | 0.1696 | 0.1659 | 0.2736 | 0.7264 | 126362 | 18663 | 3156 | 3309 |
1.0827 | 57.0 | 84873 | 1.0865 | 0.0725 | 0.1690 | 0.1654 | 0.2728 | 0.7272 | 126356 | 18599 | 3226 | 3214 |
1.0794 | 58.0 | 86362 | 1.0837 | 0.0717 | 0.1665 | 0.1629 | 0.2691 | 0.7309 | 126790 | 18361 | 3030 | 3287 |
1.0585 | 59.0 | 87851 | 1.0816 | 0.0710 | 0.1664 | 0.1629 | 0.2688 | 0.7312 | 126738 | 18285 | 3158 | 3218 |
1.0549 | 60.0 | 89340 | 1.0785 | 0.0707 | 0.1651 | 0.1616 | 0.2671 | 0.7329 | 126913 | 18198 | 3070 | 3195 |
1.0708 | 61.0 | 90829 | 1.0795 | 0.0704 | 0.1649 | 0.1614 | 0.2667 | 0.7333 | 126928 | 18157 | 3096 | 3178 |
1.0674 | 62.0 | 92318 | 1.0767 | 0.0699 | 0.1638 | 0.1605 | 0.2650 | 0.7350 | 126981 | 17994 | 3206 | 3071 |
1.0709 | 63.0 | 93807 | 1.0738 | 0.0699 | 0.1638 | 0.1605 | 0.2652 | 0.7348 | 126999 | 18030 | 3152 | 3096 |
1.0672 | 64.0 | 95296 | 1.0734 | 0.0687 | 0.1622 | 0.1588 | 0.2630 | 0.7370 | 127257 | 17925 | 2999 | 3105 |
1.0716 | 65.0 | 96785 | 1.0712 | 0.0685 | 0.1610 | 0.1577 | 0.2613 | 0.7387 | 127412 | 17804 | 2965 | 3082 |
1.0664 | 66.0 | 98274 | 1.0723 | 0.0686 | 0.1613 | 0.1581 | 0.2618 | 0.7382 | 127312 | 17817 | 3052 | 3039 |
1.0452 | 67.0 | 99763 | 1.0703 | 0.0681 | 0.1605 | 0.1572 | 0.2605 | 0.7395 | 127444 | 17742 | 2995 | 3041 |
1.0318 | 68.0 | 101252 | 1.0695 | 0.0679 | 0.1603 | 0.1571 | 0.2601 | 0.7399 | 127479 | 17693 | 3009 | 3049 |
1.0341 | 69.0 | 102741 | 1.0686 | 0.0677 | 0.1597 | 0.1565 | 0.2590 | 0.7410 | 127600 | 17599 | 2982 | 3088 |
1.0338 | 70.0 | 104230 | 1.0688 | 0.0677 | 0.1598 | 0.1565 | 0.2593 | 0.7407 | 127573 | 17637 | 2971 | 3069 |
Framework versions
- Transformers 4.40.0.dev0
- Pytorch 2.2.0+rocm5.6
- Datasets 2.18.0
- Tokenizers 0.15.2
Wandb run
https://wandb.ai/butspeechfit/decred_commonvoice_en/runs/ED_small_cv_v2_continue3
- Downloads last month
- 0