kimbochen commited on
Commit
b29e3e1
1 Parent(s): 3d5c12d

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -19
README.md CHANGED
@@ -1,41 +1,38 @@
1
  ---
2
- language:
3
- - zh
4
  license: apache-2.0
5
  tags:
6
- - whisper-event
7
  - generated_from_trainer
8
  datasets:
9
- - mozilla-foundation/common_voice_11_0
10
  metrics:
11
  - wer
12
  model-index:
13
- - name: Whisper Small Chinese - Kimbo Chen
14
  results:
15
  - task:
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
- name: Common Voice 11.0
20
- type: mozilla-foundation/common_voice_11_0
21
  config: zh-TW
22
  split: test
23
  args: zh-TW
24
  metrics:
25
  - name: Wer
26
  type: wer
27
- value: 40.81883316274309
28
  ---
29
 
30
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
  should probably proofread and complete it, then remove this comment. -->
32
 
33
- # Whisper Small Chinese - Kimbo Chen
34
 
35
- This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
36
  It achieves the following results on the evaluation set:
37
- - Loss: 0.1984
38
- - Wer: 40.8188
39
 
40
  ## Model description
41
 
@@ -56,11 +53,11 @@ More information needed
56
  The following hyperparameters were used during training:
57
  - learning_rate: 1e-05
58
  - train_batch_size: 64
59
- - eval_batch_size: 8
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
- - lr_scheduler_warmup_steps: 100
64
  - training_steps: 1000
65
  - mixed_precision_training: Native AMP
66
 
@@ -68,11 +65,11 @@ The following hyperparameters were used during training:
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
71
- | 0.1438 | 1.05 | 200 | 0.1822 | 42.4360 |
72
- | 0.0315 | 2.1 | 400 | 0.1869 | 42.1290 |
73
- | 0.0113 | 4.01 | 600 | 0.1953 | 40.6346 |
74
- | 0.0053 | 5.06 | 800 | 0.1950 | 40.6755 |
75
- | 0.0035 | 6.11 | 1000 | 0.1984 | 40.8188 |
76
 
77
 
78
  ### Framework versions
 
1
  ---
 
 
2
  license: apache-2.0
3
  tags:
 
4
  - generated_from_trainer
5
  datasets:
6
+ - common_voice_11_0
7
  metrics:
8
  - wer
9
  model-index:
10
+ - name: openai/whisper-small
11
  results:
12
  - task:
13
  name: Automatic Speech Recognition
14
  type: automatic-speech-recognition
15
  dataset:
16
+ name: common_voice_11_0
17
+ type: common_voice_11_0
18
  config: zh-TW
19
  split: test
20
  args: zh-TW
21
  metrics:
22
  - name: Wer
23
  type: wer
24
+ value: 32.594792142530835
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
  should probably proofread and complete it, then remove this comment. -->
29
 
30
+ # openai/whisper-small
31
 
32
+ This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_11_0 dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.3250
35
+ - Wer: 32.5948
36
 
37
  ## Model description
38
 
 
53
  The following hyperparameters were used during training:
54
  - learning_rate: 1e-05
55
  - train_batch_size: 64
56
+ - eval_batch_size: 32
57
  - seed: 42
58
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
59
  - lr_scheduler_type: linear
60
+ - lr_scheduler_warmup_steps: 800
61
  - training_steps: 1000
62
  - mixed_precision_training: Native AMP
63
 
 
65
 
66
  | Training Loss | Epoch | Step | Validation Loss | Wer |
67
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
68
+ | 0.3465 | 1.05 | 200 | 0.3499 | 41.9324 |
69
+ | 0.2137 | 2.1 | 400 | 0.2953 | 36.2951 |
70
+ | 0.1255 | 4.01 | 600 | 0.2927 | 33.7232 |
71
+ | 0.0509 | 5.06 | 800 | 0.3149 | 34.0566 |
72
+ | 0.0164 | 6.11 | 1000 | 0.3250 | 32.5948 |
73
 
74
 
75
  ### Framework versions