sanchit-gandhi HF staff commited on
Commit
e076bda
1 Parent(s): d6442f8

update model card README.md

Browse files
README.md ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ datasets:
5
+ - librispeech_asr
6
+ model-index:
7
+ - name: ''
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ #
15
+
16
+ This model was trained from scratch on the librispeech_asr dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.8384
19
+ - Wer: 0.1367
20
+
21
+ ## Model description
22
+
23
+ More information needed
24
+
25
+ ## Intended uses & limitations
26
+
27
+ More information needed
28
+
29
+ ## Training and evaluation data
30
+
31
+ More information needed
32
+
33
+ ## Training procedure
34
+
35
+ ### Training hyperparameters
36
+
37
+ The following hyperparameters were used during training:
38
+ - learning_rate: 3e-05
39
+ - train_batch_size: 8
40
+ - eval_batch_size: 8
41
+ - seed: 42
42
+ - gradient_accumulation_steps: 4
43
+ - total_train_batch_size: 32
44
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
+ - lr_scheduler_type: linear
46
+ - lr_scheduler_warmup_steps: 1000
47
+ - num_epochs: 20.0
48
+ - mixed_precision_training: Native AMP
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Wer |
53
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|
54
+ | 6.2245 | 1.68 | 1500 | 6.1442 | 1.5986 |
55
+ | 5.4521 | 3.36 | 3000 | 5.4335 | 1.6439 |
56
+ | 3.3659 | 5.04 | 4500 | 3.6455 | 0.6503 |
57
+ | 1.5724 | 6.73 | 6000 | 2.3554 | 0.3386 |
58
+ | 1.4759 | 8.41 | 7500 | 1.7423 | 0.2889 |
59
+ | 1.0826 | 10.09 | 9000 | 1.3818 | 0.2209 |
60
+ | 0.6769 | 11.77 | 10500 | 1.1268 | 0.1737 |
61
+ | 0.7348 | 13.45 | 12000 | 0.9990 | 0.1575 |
62
+ | 0.5419 | 15.13 | 13500 | 0.9435 | 0.1560 |
63
+ | 0.4212 | 16.82 | 15000 | 0.8678 | 0.1405 |
64
+ | 0.3805 | 18.5 | 16500 | 0.8384 | 0.1367 |
65
+
66
+
67
+ ### Framework versions
68
+
69
+ - Transformers 4.17.0.dev0
70
+ - Pytorch 1.10.2+cu113
71
+ - Datasets 1.18.3
72
+ - Tokenizers 0.11.0
wandb/run-20220315_195757-3ex43zbl/files/output.log CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b4096de07ce343cf58ce0d906e04c00737af2f9fe745857c70aefc5be3d8c038
3
- size 31631592
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39a56005e56bf40e540a1f88c162ca419a22be0951f9fc73c5ac374bfabcfe38
3
+ size 31670454
wandb/run-20220315_195757-3ex43zbl/logs/debug-internal.log CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f582d8b529ace88557387ad1ad4b146bc9ca985a6e833e3c5e60c66881dca0d
3
- size 31282082
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e8db38a40fa21007b2528a29d3342416278a348d13ba23dc21db5a116110ab7
3
+ size 31303520
wandb/run-20220315_195757-3ex43zbl/run-3ex43zbl.wandb CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1a682c4ea0a4d1acd5a990374694bc82aaf0598e6002d5f945e59281a2b38cb1
3
- size 1289420917
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3dedc3cec1efde3165341d4e47bc43602e8b2c5caf1bd828d569fdd41b6bcb8d
3
+ size 1289494265