tjadamlee commited on
Commit
d3210ea
1 Parent(s): 26f5dec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -53,8 +53,8 @@ BELLE is based on Bloomz-7b1-mt and finetuned with 0.2M Chinese data combined wi
53
 
54
  The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
55
 
56
- There are current avaliable model checkpoints:
57
- | Datasize| 0.2M | 0.6M | 1M |
58
  | ----- | ----- | ----- | ----- |
59
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
60
 
@@ -119,8 +119,8 @@ BELLE模型以Bloomz-7b1-mt为基础,在 0.2M 条中文数据上,结合Stanf
119
 
120
  中文数据生成代码及其他详细信息,位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
121
 
122
- 这里是当前可获取的模型:
123
- | Datasize| 0.2M | 0.6M | 1M |
124
  | ----- | ----- | ----- | ----- |
125
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
126
 
 
53
 
54
  The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
55
 
56
+ We trained models using datasets of different sizes (200,000, 600,000, and 1,000,000 samples) for instruction learning, and we obtained different model versions as shown below:
57
+ | Datasize| 200,000 | 600,000 | 1,000,000 |
58
  | ----- | ----- | ----- | ----- |
59
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
60
 
 
119
 
120
  中文数据生成代码及其他详细信息,位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
121
 
122
+ 我们采取了不同大小规模(20万、60万和100万样本)的指令学习的数据集训练模型,我们得到不同的模型版本如下所示:
123
+ | Datasize| 200,000 | 600,000 | 1,000,000 |
124
  | ----- | ----- | ----- | ----- |
125
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
126