Update README.md
Browse files
README.md
CHANGED
@@ -21,3 +21,37 @@ model-index:
|
|
21 |
type: wer
|
22 |
value: 0.0655
|
23 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
21 |
type: wer
|
22 |
value: 0.0655
|
23 |
---
|
24 |
+
|
25 |
+
# wav2vec2-bert-uk
|
26 |
+
|
27 |
+
- Join our Speech Recognition Group in Telegram: https://t.me/speech_recognition_uk
|
28 |
+
- Join our **Discord server** - https://discord.gg/nmUCXz55 - where we're talking about AI
|
29 |
+
|
30 |
+
This model was trained with the following hparams with 2 RTX A4000:
|
31 |
+
|
32 |
+
```
|
33 |
+
torchrun --standalone --nnodes=1 --nproc-per-node=2 ../train_w2v2_bert.py \
|
34 |
+
--custom_set ~/cv10/train.csv \
|
35 |
+
--custom_set_eval ~/cv10/test.csv \
|
36 |
+
--num_train_epochs 15 \
|
37 |
+
--tokenize_config . \
|
38 |
+
--w2v2_bert_model facebook/w2v-bert-2.0 \
|
39 |
+
--batch 4 \
|
40 |
+
--num_proc 5 \
|
41 |
+
--grad_accum 1 \
|
42 |
+
--learning_rate 3e-5 \
|
43 |
+
--logging_steps 20 \
|
44 |
+
--eval_step 500 \
|
45 |
+
--group_by_length \
|
46 |
+
--attention_dropout 0.0 \
|
47 |
+
--activation_dropout 0.05 \
|
48 |
+
--feat_proj_dropout 0.05 \
|
49 |
+
--feat_quantizer_dropout 0.0 \
|
50 |
+
--hidden_dropout 0.05 \
|
51 |
+
--layerdrop 0.0 \
|
52 |
+
--final_dropout 0.0 \
|
53 |
+
--mask_time_prob 0.0 \
|
54 |
+
--mask_time_length 10 \
|
55 |
+
--mask_feature_prob 0.0 \
|
56 |
+
--mask_feature_length 10
|
57 |
+
```
|