Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ Creator & Uploader: Jooyoung Lee (excalibur12@snu.ac.kr)
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a phonetically balanced native Korean read-speech corpus.
|
18 |
|
19 |
-
## Training and
|
20 |
|
21 |
Training Data
|
22 |
- Data Name: Phonetically Balanced Native Korean Read-speech Corpus
|
@@ -28,7 +28,7 @@ Evaluation Data
|
|
28 |
- Num. of Samples: 6,000
|
29 |
- Audio Length: 12 Hours
|
30 |
|
31 |
-
### Training
|
32 |
|
33 |
The following hyperparameters were used during training:
|
34 |
- learning_rate: 0.0001
|
@@ -47,12 +47,13 @@ The following hyperparameters were used during training:
|
|
47 |
|
48 |
Phone Error Rate 3.88%
|
49 |
|
50 |
-
### MFA-IPA
|
51 |
|
52 |
-
Vowels
|
53 |
![mfa_ipa_chart_vowels](./mfa_ipa_chart_vowels.png)
|
54 |
|
55 |
-
Consonants
|
|
|
56 |
|
57 |
### Framework versions
|
58 |
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a phonetically balanced native Korean read-speech corpus.
|
18 |
|
19 |
+
## Training and Evaluation Data
|
20 |
|
21 |
Training Data
|
22 |
- Data Name: Phonetically Balanced Native Korean Read-speech Corpus
|
|
|
28 |
- Num. of Samples: 6,000
|
29 |
- Audio Length: 12 Hours
|
30 |
|
31 |
+
### Training Hyperparameters
|
32 |
|
33 |
The following hyperparameters were used during training:
|
34 |
- learning_rate: 0.0001
|
|
|
47 |
|
48 |
Phone Error Rate 3.88%
|
49 |
|
50 |
+
### MFA-IPA Phoneset Tables
|
51 |
|
52 |
+
# Vowels
|
53 |
![mfa_ipa_chart_vowels](./mfa_ipa_chart_vowels.png)
|
54 |
|
55 |
+
# Consonants
|
56 |
+
![mfa_ipa_chart_consonants](./mfa_ipa_chart_consonants.png)
|
57 |
|
58 |
### Framework versions
|
59 |
|