Update README.md
Browse files
README.md
CHANGED
@@ -80,10 +80,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
80 |
|
81 |
# Whisper Medium ID - FLEURS-CV-LBV - Augmented
|
82 |
|
83 |
-
This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the:
|
84 |
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
85 |
- [google/fleurs](https://huggingface.co/datasets/google/fleurs)
|
86 |
-
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
|
87 |
|
88 |
It achieves the following results on the evaluation set (Common Voice 11.0):
|
89 |
- Loss: 0.2788
|
@@ -100,10 +100,20 @@ More information needed
|
|
100 |
|
101 |
## Training and evaluation data
|
102 |
|
103 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
104 |
|
105 |
## Training procedure
|
106 |
|
|
|
|
|
107 |
### Training hyperparameters
|
108 |
|
109 |
The following hyperparameters were used during training:
|
|
|
80 |
|
81 |
# Whisper Medium ID - FLEURS-CV-LBV - Augmented
|
82 |
|
83 |
+
This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on the following datasets:
|
84 |
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0)
|
85 |
- [google/fleurs](https://huggingface.co/datasets/google/fleurs)
|
86 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia)
|
87 |
|
88 |
It achieves the following results on the evaluation set (Common Voice 11.0):
|
89 |
- Loss: 0.2788
|
|
|
100 |
|
101 |
## Training and evaluation data
|
102 |
|
103 |
+
Training:
|
104 |
+
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0) (train+validation)
|
105 |
+
- [google/fleurs](https://huggingface.co/datasets/google/fleurs) (train+validation)
|
106 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia) (train)
|
107 |
+
|
108 |
+
Evaluation:
|
109 |
+
- [mozilla-foundation/common_voice_11_0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0) (test)
|
110 |
+
- [google/fleurs](https://huggingface.co/datasets/google/fleurs) (test)
|
111 |
+
- [indonesian-nlp/librivox-indonesia](https://huggingface.co/datasets/indonesian-nlp/librivox-indonesia) (test)
|
112 |
|
113 |
## Training procedure
|
114 |
|
115 |
+
Datasets were augmented on-the-fly using [audiomentations](https://github.com/iver56/audiomentations) via PitchShift, AddGaussianNoise and TimeStretch transformations at `p=0.3`.
|
116 |
+
|
117 |
### Training hyperparameters
|
118 |
|
119 |
The following hyperparameters were used during training:
|