Neruzo commited on
Commit
1494e51
1 Parent(s): 34c19bc

End of training

Browse files
Files changed (2) hide show
  1. README.md +13 -16
  2. generation_config.json +1 -1
README.md CHANGED
@@ -1,12 +1,12 @@
1
  ---
2
  language:
3
- - hi
4
  license: apache-2.0
5
  base_model: openai/whisper-small
6
  tags:
7
  - generated_from_trainer
8
  datasets:
9
- - mozilla-foundation/common_voice_11_0
10
  metrics:
11
  - wer
12
  model-index:
@@ -16,15 +16,13 @@ model-index:
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
- name: Common Voice 11.0
20
- type: mozilla-foundation/common_voice_11_0
21
- config: hi
22
- split: None
23
- args: 'config: hi, split: test'
24
  metrics:
25
  - name: Wer
26
  type: wer
27
- value: 56.36163548632862
28
  ---
29
 
30
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,10 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  # Whisper Small Hi - Sanchit Gandhi
34
 
35
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
36
  It achieves the following results on the evaluation set:
37
- - Loss: 0.5549
38
- - Wer: 56.3616
39
 
40
  ## Model description
41
 
@@ -55,21 +53,20 @@ More information needed
55
 
56
  The following hyperparameters were used during training:
57
  - learning_rate: 1e-05
58
- - train_batch_size: 8
59
- - eval_batch_size: 8
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 5
64
- - training_steps: 20
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
71
- | 0.2754 | 0.01 | 10 | 0.5915 | 58.5541 |
72
- | 0.4426 | 0.02 | 20 | 0.5549 | 56.3616 |
73
 
74
 
75
  ### Framework versions
 
1
  ---
2
  language:
3
+ - vi
4
  license: apache-2.0
5
  base_model: openai/whisper-small
6
  tags:
7
  - generated_from_trainer
8
  datasets:
9
+ - vivos
10
  metrics:
11
  - wer
12
  model-index:
 
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
+ name: '##'
20
+ type: vivos
21
+ args: 'config: vi, split: test'
 
 
22
  metrics:
23
  - name: Wer
24
  type: wer
25
+ value: 97.2027972027972
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  # Whisper Small Hi - Sanchit Gandhi
32
 
33
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the ## dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.3333
36
+ - Wer: 97.2028
37
 
38
  ## Model description
39
 
 
53
 
54
  The following hyperparameters were used during training:
55
  - learning_rate: 1e-05
56
+ - train_batch_size: 16
57
+ - eval_batch_size: 2
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - lr_scheduler_warmup_steps: 5
62
+ - training_steps: 10
63
  - mixed_precision_training: Native AMP
64
 
65
  ### Training results
66
 
67
  | Training Loss | Epoch | Step | Validation Loss | Wer |
68
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
69
+ | 1.6611 | 0.01 | 10 | 1.3333 | 97.2028 |
 
70
 
71
 
72
  ### Framework versions
generation_config.json CHANGED
@@ -150,7 +150,7 @@
150
  "<|yo|>": 50325,
151
  "<|zh|>": 50260
152
  },
153
- "language": "hindi",
154
  "max_initial_timestamp_index": 50,
155
  "max_length": 448,
156
  "no_timestamps_token_id": 50363,
 
150
  "<|yo|>": 50325,
151
  "<|zh|>": 50260
152
  },
153
+ "language": "vietnamese",
154
  "max_initial_timestamp_index": 50,
155
  "max_length": 448,
156
  "no_timestamps_token_id": 50363,