BelleGroup
/

BELLE-7B-0.2M

Text2Text Generation

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tjadamlee commited on Mar 18, 2023

Commit

d3210ea

•

1 Parent(s): 26f5dec

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -53,8 +53,8 @@ BELLE is based on Bloomz-7b1-mt and finetuned with 0.2M Chinese data combined wi
 The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
-There are current avaliable model checkpoints:
-| Datasize| 0.2M | 0.6M | 1M |
 | ----- | ----- | ----- | ----- |
 | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
@@ -119,8 +119,8 @@ BELLE模型以Bloomz-7b1-mt为基础，在 0.2M 条中文数据上，结合Stanf
 中文数据生成代码及其他详细信息，位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
-这里是当前可获取的模型:
-| Datasize| 0.2M | 0.6M | 1M |
 | ----- | ----- | ----- | ----- |
 | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |

 The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
+We trained models using datasets of different sizes (200,000, 600,000, and 1,000,000 samples) for instruction learning, and we obtained different model versions as shown below:
+| Datasize| 200,000 | 600,000 | 1,000,000 |
 | ----- | ----- | ----- | ----- |
 | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
 中文数据生成代码及其他详细信息，位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
+我们采取了不同大小规模（20万、60万和100万样本）的指令学习的数据集训练模型，我们得到不同的模型版本如下所示:
+| Datasize| 200,000 | 600,000 | 1,000,000 |
 | ----- | ----- | ----- | ----- |
 | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |