slplab
/

wav2vec2-xls-r-300m_phone-mfa_korean

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

slplab commited on Jan 6, 2023

Commit

9bd2de9

•

1 Parent(s): 41f0556

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ Creator & Uploader: Jooyoung Lee (excalibur12@snu.ac.kr)
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a phonetically balanced native Korean read-speech corpus.
-## Training and evaluation data
 Training Data
 - Data Name: Phonetically Balanced Native Korean Read-speech Corpus
@@ -28,7 +28,7 @@ Evaluation Data
 - Num. of Samples: 6,000
 - Audio Length: 12 Hours
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
@@ -47,12 +47,13 @@ The following hyperparameters were used during training:
 Phone Error Rate 3.88%
-### MFA-IPA phoneset table
-Vowels
 ![mfa_ipa_chart_vowels](./mfa_ipa_chart_vowels.png)
-Consonants
 ### Framework versions

 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on a phonetically balanced native Korean read-speech corpus.
+## Training and Evaluation Data
 Training Data
 - Data Name: Phonetically Balanced Native Korean Read-speech Corpus
 - Num. of Samples: 6,000
 - Audio Length: 12 Hours
+### Training Hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 0.0001
 Phone Error Rate 3.88%
+### MFA-IPA Phoneset Tables
+# Vowels
 ![mfa_ipa_chart_vowels](./mfa_ipa_chart_vowels.png)
+# Consonants
+![mfa_ipa_chart_consonants](./mfa_ipa_chart_consonants.png)
 ### Framework versions