Naozumi0512
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ datasets:
|
|
13 |
|
14 |
# g2pW-canto-20241206-bert-base
|
15 |
|
16 |
-
This is a **G2P (Grapheme-to-Phoneme)** model trained on the [Naozumi0512/g2p-Cantonese-aggregate](https://huggingface.co/datasets/Naozumi0512/g2p-Cantonese-aggregate-pos-retag) dataset and evaluated on the [yue-g2p-benchmark](https://github.com/hon9kon9ize/yue-g2p-benchmark).
|
17 |
|
18 |
## Model Overview
|
19 |
|
@@ -23,7 +23,7 @@ The model uses **[hon9kon9ize/bert-base-cantonese](https://huggingface.co/hon9ko
|
|
23 |
|
24 |
## Dataset
|
25 |
|
26 |
-
The model was trained on the [Naozumi0512/g2p-Cantonese-aggregate](https://huggingface.co/datasets/Naozumi0512/g2p-Cantonese-aggregate-pos-retag) dataset, which includes:
|
27 |
|
28 |
- **68,500 Cantonese words/phrases** with corresponding phonetic transcriptions.
|
29 |
- Data is formatted to align with the **CPP (Chinese Polyphones with Pinyin)** structure.
|
|
|
13 |
|
14 |
# g2pW-canto-20241206-bert-base
|
15 |
|
16 |
+
This is a **G2P (Grapheme-to-Phoneme)** model trained on the [Naozumi0512/g2p-Cantonese-aggregate-pos-retag](https://huggingface.co/datasets/Naozumi0512/g2p-Cantonese-aggregate-pos-retag) dataset and evaluated on the [yue-g2p-benchmark](https://github.com/hon9kon9ize/yue-g2p-benchmark).
|
17 |
|
18 |
## Model Overview
|
19 |
|
|
|
23 |
|
24 |
## Dataset
|
25 |
|
26 |
+
The model was trained on the [Naozumi0512/g2p-Cantonese-aggregate-pos-retag](https://huggingface.co/datasets/Naozumi0512/g2p-Cantonese-aggregate-pos-retag) dataset, which includes:
|
27 |
|
28 |
- **68,500 Cantonese words/phrases** with corresponding phonetic transcriptions.
|
29 |
- Data is formatted to align with the **CPP (Chinese Polyphones with Pinyin)** structure.
|