Watarungurunnn commited on
Commit
9fc24e5
1 Parent(s): 3ac035f

whisper-large-v3-ja

Browse files
README.md CHANGED
@@ -1,18 +1,18 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
  datasets:
6
  - common_voice_16_0
7
  metrics:
8
  - wer
9
- base_model: openai/whisper-large-v3
10
  model-index:
11
  - name: whisper-large-v3-ja
12
  results:
13
  - task:
14
- type: automatic-speech-recognition
15
  name: Automatic Speech Recognition
 
16
  dataset:
17
  name: common_voice_16_0
18
  type: common_voice_16_0
@@ -20,9 +20,9 @@ model-index:
20
  split: validation
21
  args: ja
22
  metrics:
23
- - type: wer
24
- value: 38.775510204081634
25
- name: Wer
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the common_voice_16_0 dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 2.6403
36
- - Wer: 38.7755
37
 
38
  ## Model description
39
 
@@ -61,14 +61,21 @@ The following hyperparameters were used during training:
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 500
64
- - training_steps: 1
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
71
- | 1.7023 | 1.0 | 1 | 2.6403 | 38.7755 |
 
 
 
 
 
 
 
72
 
73
 
74
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: openai/whisper-large-v3
4
  tags:
5
  - generated_from_trainer
6
  datasets:
7
  - common_voice_16_0
8
  metrics:
9
  - wer
 
10
  model-index:
11
  - name: whisper-large-v3-ja
12
  results:
13
  - task:
 
14
  name: Automatic Speech Recognition
15
+ type: automatic-speech-recognition
16
  dataset:
17
  name: common_voice_16_0
18
  type: common_voice_16_0
 
20
  split: validation
21
  args: ja
22
  metrics:
23
+ - name: Wer
24
+ type: wer
25
+ value: 14.696501005043272
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [openai/whisper-large-v3](https://huggingface.co/openai/whisper-large-v3) on the common_voice_16_0 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.4210
36
+ - Wer: 14.6965
37
 
38
  ## Model description
39
 
 
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 500
64
+ - training_steps: 4000
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Wer |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|
71
+ | 0.1542 | 1.69 | 500 | 0.2712 | 15.6149 |
72
+ | 0.0351 | 3.39 | 1000 | 0.3074 | 16.1866 |
73
+ | 0.0081 | 5.08 | 1500 | 0.3475 | 15.3802 |
74
+ | 0.0049 | 6.78 | 2000 | 0.3427 | 15.1804 |
75
+ | 0.001 | 8.47 | 2500 | 0.3851 | 14.7302 |
76
+ | 0.0004 | 10.17 | 3000 | 0.4109 | 14.7254 |
77
+ | 0.0003 | 11.86 | 3500 | 0.4168 | 14.6953 |
78
+ | 0.0003 | 13.56 | 4000 | 0.4210 | 14.6965 |
79
 
80
 
81
  ### Framework versions
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:08e0005225b3dbaf55dd13ac62926cc7e02c1025d66fa375e6fb305ff79cd4f9
3
  size 4993448880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4b80ec7637784a453aae06edac8d3c9dd25c2e6386a54db78a9cd35b6dd59b6
3
  size 4993448880
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:630ca774672856d2e0e39a702e590f635a1cfc5726a64b6578ab46dd367369a9
3
  size 1180663192
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46e27ac4e2f66f534d39d80fee3cf43b9981ad847532e1a2a840d1d72a61e603
3
  size 1180663192