DrishtiSharma commited on
Commit
07926a3
1 Parent(s): 3579ac6

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md ADDED
@@ -0,0 +1,97 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - sl
4
+ license: apache-2.0
5
+ tags:
6
+ - automatic-speech-recognition
7
+ - mozilla-foundation/common_voice_8_0
8
+ - generated_from_trainer
9
+ - sl
10
+ - robust-speech-event
11
+ - model_for_talk
12
+ datasets:
13
+ - common_voice
14
+ model-index:
15
+ - name: wav2vec2-large-xls-r-300m-sl-with-LM-v1
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: Common Voice 8
22
+ type: mozilla-foundation/common_voice_8_0
23
+ args: sl
24
+ metrics:
25
+ - name: Test WER
26
+ type: wer
27
+ value: 0.20626555409164105
28
+ - name: Test CER
29
+ type: cer
30
+ value: 0.051648321634392154
31
+ ---
32
+
33
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
34
+ should probably proofread and complete it, then remove this comment. -->
35
+
36
+ #
37
+
38
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - SL dataset.
39
+ It achieves the following results on the evaluation set:
40
+ - Loss: 0.2756
41
+ - Wer: 0.2279
42
+
43
+ ## Model description
44
+
45
+ More information needed
46
+
47
+ ## Intended uses & limitations
48
+
49
+ More information needed
50
+
51
+ ## Training and evaluation data
52
+
53
+ More information needed
54
+
55
+ ## Training procedure
56
+
57
+ ### Training hyperparameters
58
+
59
+ The following hyperparameters were used during training:
60
+ - learning_rate: 7.1e-05
61
+ - train_batch_size: 32
62
+ - eval_batch_size: 32
63
+ - seed: 42
64
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
+ - lr_scheduler_type: linear
66
+ - lr_scheduler_warmup_steps: 1000
67
+ - num_epochs: 100.0
68
+ - mixed_precision_training: Native AMP
69
+
70
+ ### Training results
71
+
72
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
73
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
74
+ | 3.3881 | 6.1 | 500 | 2.9710 | 1.0 |
75
+ | 2.6401 | 12.2 | 1000 | 1.7677 | 0.9734 |
76
+ | 1.5152 | 18.29 | 1500 | 0.5564 | 0.6011 |
77
+ | 1.2191 | 24.39 | 2000 | 0.4319 | 0.4390 |
78
+ | 1.0237 | 30.49 | 2500 | 0.3141 | 0.3175 |
79
+ | 0.8892 | 36.59 | 3000 | 0.2748 | 0.2689 |
80
+ | 0.8296 | 42.68 | 3500 | 0.2680 | 0.2534 |
81
+ | 0.7602 | 48.78 | 4000 | 0.2820 | 0.2506 |
82
+ | 0.7186 | 54.88 | 4500 | 0.2672 | 0.2398 |
83
+ | 0.6887 | 60.98 | 5000 | 0.2729 | 0.2402 |
84
+ | 0.6507 | 67.07 | 5500 | 0.2767 | 0.2361 |
85
+ | 0.6226 | 73.17 | 6000 | 0.2817 | 0.2332 |
86
+ | 0.6024 | 79.27 | 6500 | 0.2679 | 0.2279 |
87
+ | 0.5787 | 85.37 | 7000 | 0.2837 | 0.2316 |
88
+ | 0.5744 | 91.46 | 7500 | 0.2838 | 0.2284 |
89
+ | 0.5556 | 97.56 | 8000 | 0.2763 | 0.2281 |
90
+
91
+
92
+ ### Framework versions
93
+
94
+ - Transformers 4.17.0.dev0
95
+ - Pytorch 1.10.2+cu102
96
+ - Datasets 1.18.2.dev0
97
+ - Tokenizers 0.11.0