indiejoseph commited on
Commit
4178090
1 Parent(s): beaf5b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -5,6 +5,13 @@ tags:
5
  model-index:
6
  - name: bert-base-cantonese
7
  results: []
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -12,11 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # bert-base-cantonese
14
 
15
- This model is a fine-tuned version of [/notebooks/cantonese/bert-base-cantonese](https://huggingface.co//notebooks/cantonese/bert-base-cantonese) on an unknown dataset.
16
 
17
  ## Model description
18
 
19
- More information needed
 
20
 
21
  ## Intended uses & limitations
22
 
 
5
  model-index:
6
  - name: bert-base-cantonese
7
  results: []
8
+ license: cc-by-4.0
9
+ language:
10
+ - yue
11
+ pipeline_tag: fill-mask
12
+ widget:
13
+ - text: 香港原本[MASK]一個人煙稀少嘅漁港。
14
+ example_title: 係
15
  ---
16
 
17
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
19
 
20
  # bert-base-cantonese
21
 
22
+ This model is a continue pre-train version of [bert-base-chinese](https://huggingface.co/bert-base-chinese) on [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered).
23
 
24
  ## Model description
25
 
26
+ This model has extended 500 more Chinese characters which very common in Cantonese, such as `冧`, `噉`, `麪`, `笪`, `冚`, `乸` etc, and continue pre-trained with [indiejoseph/wikipedia-zh-yue-filtered](https://huggingface.co/datasets/indiejoseph/wikipedia-zh-yue-filtered)
27
+
28
 
29
  ## Intended uses & limitations
30