DrishtiSharma commited on
Commit
0a52ef5
1 Parent(s): 736e004

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md ADDED
@@ -0,0 +1,97 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - sl
4
+ license: apache-2.0
5
+ tags:
6
+ - automatic-speech-recognition
7
+ - mozilla-foundation/common_voice_8_0
8
+ - generated_from_trainer
9
+ - sl
10
+ - robust-speech-event
11
+ - model_for_talk
12
+ datasets:
13
+ - common_voice
14
+ model-index:
15
+ - name: wav2vec2-large-xls-r-300m-sl-with-LM-v2
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: Common Voice 8
22
+ type: mozilla-foundation/common_voice_8_0
23
+ args: sl
24
+ metrics:
25
+ - name: Test WER
26
+ type: wer
27
+ value: 0.21695212999560826
28
+ - name: Test CER
29
+ type: cer
30
+ value: 0.052850080572474256
31
+ ---
32
+
33
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
34
+ should probably proofread and complete it, then remove this comment. -->
35
+
36
+ #
37
+
38
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - SL dataset.
39
+ It achieves the following results on the evaluation set:
40
+ - Loss: 0.2855
41
+ - Wer: 0.2401
42
+
43
+ ## Model description
44
+
45
+ More information needed
46
+
47
+ ## Intended uses & limitations
48
+
49
+ More information needed
50
+
51
+ ## Training and evaluation data
52
+
53
+ More information needed
54
+
55
+ ## Training procedure
56
+
57
+ ### Training hyperparameters
58
+
59
+ The following hyperparameters were used during training:
60
+ - learning_rate: 7e-05
61
+ - train_batch_size: 32
62
+ - eval_batch_size: 32
63
+ - seed: 42
64
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
+ - lr_scheduler_type: linear
66
+ - lr_scheduler_warmup_steps: 1000
67
+ - num_epochs: 100.0
68
+ - mixed_precision_training: Native AMP
69
+
70
+ ### Training results
71
+
72
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
73
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
74
+ | 6.9294 | 6.1 | 500 | 2.9712 | 1.0 |
75
+ | 2.8305 | 12.2 | 1000 | 1.7073 | 0.9479 |
76
+ | 1.4795 | 18.29 | 1500 | 0.5756 | 0.6397 |
77
+ | 1.3433 | 24.39 | 2000 | 0.4968 | 0.5424 |
78
+ | 1.1766 | 30.49 | 2500 | 0.4185 | 0.4743 |
79
+ | 1.0017 | 36.59 | 3000 | 0.3303 | 0.3578 |
80
+ | 0.9358 | 42.68 | 3500 | 0.3003 | 0.3051 |
81
+ | 0.8358 | 48.78 | 4000 | 0.3045 | 0.2884 |
82
+ | 0.7647 | 54.88 | 4500 | 0.2866 | 0.2677 |
83
+ | 0.7482 | 60.98 | 5000 | 0.2829 | 0.2585 |
84
+ | 0.6943 | 67.07 | 5500 | 0.2782 | 0.2478 |
85
+ | 0.6586 | 73.17 | 6000 | 0.2911 | 0.2537 |
86
+ | 0.6425 | 79.27 | 6500 | 0.2817 | 0.2462 |
87
+ | 0.6067 | 85.37 | 7000 | 0.2910 | 0.2436 |
88
+ | 0.5974 | 91.46 | 7500 | 0.2875 | 0.2430 |
89
+ | 0.5812 | 97.56 | 8000 | 0.2852 | 0.2396 |
90
+
91
+
92
+ ### Framework versions
93
+
94
+ - Transformers 4.17.0.dev0
95
+ - Pytorch 1.10.2+cu102
96
+ - Datasets 1.18.2.dev0
97
+ - Tokenizers 0.11.0