uer
/

roberta-base-word-chinese-cluecorpussmall

Inference Endpoints

Model card Files Files and versions Community

uer commited on Mar 10, 2021

Commit

6a77687

•

1 Parent(s): 1380449

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -9,10 +9,26 @@ widget:
 ---
 目前正在调试训练中，暂时不推荐下载
-# Chinese RoBERTa-base-word Model
 ## Model description
 We use sentencepiece model to segment Chinese word and train this RoBERTa base model. You can download the model via HuggingFace from the link [roberta-base-word-chinese-cluecorpussmall](https://huggingface.co/uer/roberta-base-word-chinese-cluecorpussmall).
 We found some bugs when using Hosted inference API. If the target character is a single word, the entire sentence will be displayed. If the target character is multiple words, only the target character will be displayed. In order to display correctly ,we recommend using the JSON Output in the lower left corner of the Hosted inference API.

 ---
 目前正在调试训练中，暂时不推荐下载
+# Chinese word-based RoBERTa Miniatures
 ## Model description
+This is the set of 5 Chinese word-based RoBERTa models pre-trained by [UER-py](https://arxiv.org/abs/1909.05658).
+[Turc et al.](https://arxiv.org/abs/1908.08962) have shown that the standard BERT recipe is effective on a wide range of model sizes. Following their paper, we released the 5 Chinese word-based RoBERTa models. In order to facilitate users to reproduce the results, we used the publicly available corpus and word segmentation tool, and provided all training details.
+You can download the 5 Chinese RoBERTa miniatures either from the [UER-py Github page](https://github.com/dbiir/UER-py/), or via HuggingFace from the links below:
+|          |           Link           |
+| -------- | :-----------------------: |
+| **Tiny**  | [**2/128 (Tiny)**][2_128] |
+| **Mini**  | [**4/256 (Mini)**][4_256] |
+| **Small**  | [**4/512 (Small)**][4_512] |
+| **Medium**  | [**8/512 (Medium)**][8_512] |
+| **Base** | [**12/768 (Base)**][12_768] |
 We use sentencepiece model to segment Chinese word and train this RoBERTa base model. You can download the model via HuggingFace from the link [roberta-base-word-chinese-cluecorpussmall](https://huggingface.co/uer/roberta-base-word-chinese-cluecorpussmall).
 We found some bugs when using Hosted inference API. If the target character is a single word, the entire sentence will be displayed. If the target character is multiple words, only the target character will be displayed. In order to display correctly ,we recommend using the JSON Output in the lower left corner of the Hosted inference API.