AndrewMcDowell commited on
Commit
433d77c
1 Parent(s): b566783

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -16
README.md CHANGED
@@ -1,10 +1,6 @@
1
  ---
2
- language:
3
- - ar
4
  license: apache-2.0
5
  tags:
6
- - automatic-speech-recognition
7
- - mozilla-foundation/common_voice_8_0
8
  - generated_from_trainer
9
  datasets:
10
  - common_voice
@@ -18,10 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  #
20
 
21
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - AR dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.9694
24
- - Wer: 0.7824
25
 
26
  ## Model description
27
 
@@ -40,15 +36,15 @@ More information needed
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
- - learning_rate: 0.00015
44
  - train_batch_size: 32
45
  - eval_batch_size: 8
46
  - seed: 42
47
- - gradient_accumulation_steps: 4
48
- - total_train_batch_size: 128
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
- - lr_scheduler_warmup_steps: 2000
52
  - num_epochs: 10.0
53
  - mixed_precision_training: Native AMP
54
 
@@ -56,11 +52,17 @@ The following hyperparameters were used during training:
56
 
57
  | Training Loss | Epoch | Step | Validation Loss | Wer |
58
  |:-------------:|:-----:|:----:|:---------------:|:------:|
59
- | 2.0846 | 1.68 | 500 | 1.1641 | 0.8072 |
60
- | 2.1201 | 3.35 | 1000 | 1.1776 | 0.8329 |
61
- | 2.1972 | 5.03 | 1500 | 1.2632 | 0.8724 |
62
- | 2.2643 | 6.71 | 2000 | 1.3723 | 0.8983 |
63
- | 2.1649 | 8.39 | 2500 | 1.2550 | 0.8842 |
 
 
 
 
 
 
64
 
65
 
66
  ### Framework versions
 
1
  ---
 
 
2
  license: apache-2.0
3
  tags:
 
 
4
  - generated_from_trainer
5
  datasets:
6
  - common_voice
 
14
 
15
  #
16
 
17
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the common_voice dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.5476
20
+ - Wer: 0.9696
21
 
22
  ## Model description
23
 
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - learning_rate: 6.5e-05
40
  - train_batch_size: 32
41
  - eval_batch_size: 8
42
  - seed: 42
43
+ - gradient_accumulation_steps: 2
44
+ - total_train_batch_size: 64
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - lr_scheduler_warmup_steps: 500
48
  - num_epochs: 10.0
49
  - mixed_precision_training: Native AMP
50
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Wer |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|
55
+ | 2.6638 | 0.84 | 500 | 2.3852 | 0.9974 |
56
+ | 2.6578 | 1.67 | 1000 | 2.2796 | 0.9971 |
57
+ | 2.6016 | 2.51 | 1500 | 2.0046 | 0.9961 |
58
+ | 2.5752 | 3.35 | 2000 | 1.9606 | 0.9961 |
59
+ | 2.539 | 4.19 | 2500 | 1.8836 | 0.9940 |
60
+ | 2.5214 | 5.03 | 3000 | 1.8593 | 0.9933 |
61
+ | 2.4684 | 5.86 | 3500 | 1.7816 | 0.9885 |
62
+ | 2.4134 | 6.7 | 4000 | 1.7168 | 0.9808 |
63
+ | 2.3732 | 7.54 | 4500 | 1.6406 | 0.9764 |
64
+ | 2.3371 | 8.37 | 5000 | 1.6087 | 0.9739 |
65
+ | 2.2824 | 9.21 | 5500 | 1.5476 | 0.9696 |
66
 
67
 
68
  ### Framework versions