ckip-joint
/

bloom-1b1-zh

Text Generation

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

donut32 commited on Mar 9, 2023

Commit

011c011

•

1 Parent(s): a61cd26

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -38,6 +38,7 @@ BLOOM-zh is trained extendedly on large amount of Traditional Chinese text data.
 * **License:** MEDIATEK RESEARCH License ([link](https://huggingface.co/ckip-joint/bloom-1b1-zh/blob/main/LICENSE_MR.md)) and RAIL License v1.0 ([link](https://huggingface.co/spaces/bigscience/license))
 * **Release Date Estimate:** Wednesday, 22.February.2023
 * **Send Questions to:** info@mtkresearch.com
 * **Cite as:** MediaTek Research: Traditional Chinese-enhanced BLOOM language model. International, February 2023.
 * **Organizations of contributors:**
   * MediaTek Research
@@ -64,7 +65,7 @@ For the uses of the model, please refer to [BLOOM](https://huggingface.co/bigsci
 ## Training Data
 *This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
-We trained the 1B1 parameter model on a total of 6 Billion tokens of mostly high quality Traditional Chinese text. Details are provided in the [paper(work in progress)](https://arxiv.org/).
 ## Risks and Limitations
 *This section identifies foreseeable harms and misunderstandings.*

 * **License:** MEDIATEK RESEARCH License ([link](https://huggingface.co/ckip-joint/bloom-1b1-zh/blob/main/LICENSE_MR.md)) and RAIL License v1.0 ([link](https://huggingface.co/spaces/bigscience/license))
 * **Release Date Estimate:** Wednesday, 22.February.2023
 * **Send Questions to:** info@mtkresearch.com
+* **Paper:** [paper](https://arxiv.org/abs/2303.04715)
 * **Cite as:** MediaTek Research: Traditional Chinese-enhanced BLOOM language model. International, February 2023.
 * **Organizations of contributors:**
   * MediaTek Research
 ## Training Data
 *This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
+We trained the 1B1 parameter model on a total of 6 Billion tokens of mostly high quality Traditional Chinese text. Details are provided in the [paper](https://arxiv.org/abs/2303.04715).
 ## Risks and Limitations
 *This section identifies foreseeable harms and misunderstandings.*