Mahwiz Khalil commited on
Commit
f082d0b
·
1 Parent(s): 52250e7

End of training

Browse files
README.md ADDED
@@ -0,0 +1,83 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - skr
4
+ license: cc-by-nc-4.0
5
+ base_model: facebook/mms-300m
6
+ tags:
7
+ - skr
8
+ - generated_from_trainer
9
+ datasets:
10
+ - mozilla-foundation/common_voice_11_0
11
+ metrics:
12
+ - wer
13
+ model-index:
14
+ - name: Saraiki Speech
15
+ results: []
16
+ ---
17
+
18
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
19
+ should probably proofread and complete it, then remove this comment. -->
20
+
21
+ # Saraiki Speech
22
+
23
+ This model is a fine-tuned version of [facebook/mms-300m](https://huggingface.co/facebook/mms-300m) on the common_voice dataset.
24
+ It achieves the following results on the evaluation set:
25
+ - Loss: 3.4467
26
+ - Wer: 1.0
27
+
28
+ ## Model description
29
+
30
+ More information needed
31
+
32
+ ## Intended uses & limitations
33
+
34
+ More information needed
35
+
36
+ ## Training and evaluation data
37
+
38
+ More information needed
39
+
40
+ ## Training procedure
41
+
42
+ ### Training hyperparameters
43
+
44
+ The following hyperparameters were used during training:
45
+ - learning_rate: 0.001
46
+ - train_batch_size: 8
47
+ - eval_batch_size: 8
48
+ - seed: 42
49
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
+ - lr_scheduler_type: linear
51
+ - lr_scheduler_warmup_steps: 100
52
+ - num_epochs: 10
53
+ - mixed_precision_training: Native AMP
54
+
55
+ ### Training results
56
+
57
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
58
+ |:-------------:|:-----:|:----:|:---------------:|:---:|
59
+ | 3.5576 | 0.58 | 100 | 3.5468 | 1.0 |
60
+ | 3.2271 | 1.16 | 200 | 3.6465 | 1.0 |
61
+ | 3.3492 | 1.73 | 300 | 3.8570 | 1.0 |
62
+ | 3.4442 | 2.31 | 400 | 3.7065 | 1.0 |
63
+ | 3.29 | 2.89 | 500 | 3.5289 | 1.0 |
64
+ | 3.2951 | 3.47 | 600 | 3.7043 | 1.0 |
65
+ | 3.2919 | 4.05 | 700 | 3.6748 | 1.0 |
66
+ | 3.301 | 4.62 | 800 | 3.4422 | 1.0 |
67
+ | 3.2103 | 5.2 | 900 | 3.4955 | 1.0 |
68
+ | 3.2728 | 5.78 | 1000 | 3.6059 | 1.0 |
69
+ | 3.2458 | 6.36 | 1100 | 3.4087 | 1.0 |
70
+ | 3.244 | 6.94 | 1200 | 3.4352 | 1.0 |
71
+ | 3.2562 | 7.51 | 1300 | 3.4648 | 1.0 |
72
+ | 3.2116 | 8.09 | 1400 | 3.4618 | 1.0 |
73
+ | 3.2268 | 8.67 | 1500 | 3.4313 | 1.0 |
74
+ | 3.2387 | 9.25 | 1600 | 3.4246 | 1.0 |
75
+ | 3.1921 | 9.83 | 1700 | 3.4467 | 1.0 |
76
+
77
+
78
+ ### Framework versions
79
+
80
+ - Transformers 4.36.0.dev0
81
+ - Pytorch 2.1.0+cu118
82
+ - Datasets 2.15.0
83
+ - Tokenizers 0.15.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b91ccc52ea9e5060768c5ddfa721efa9913ee7f1cd1577bbb93c80dc9f5239f1
3
  size 1262061680
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7913bcc77118ef21167e8ff6b25879660bc6c927c710b671b2789460d06c6c22
3
  size 1262061680
runs/Nov24_12-49-50_890b5f3886a6/events.out.tfevents.1700830237.890b5f3886a6.246.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75eb5bb61df85d00a6cef739a5ed1be3ce323b8a123b7e0677befb3d17a42505
3
- size 13392
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76449211351f1bbf428a749b4aac37620eec29dccecca79e7ddb42450a8d6d71
3
+ size 14221