tjadamlee commited on
Commit
d4b71ee
1 Parent(s): fb4c407

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -53,8 +53,8 @@ BELLE is based on Bloomz-7b1-mt and finetuned with 1M Chinese data combined with
53
 
54
  The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
55
 
56
- There are current avaliable model checkpoints:
57
- | Datasize| 0.2M | 0.6M | 1M |
58
  | ----- | ----- | ----- | ----- |
59
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
60
 
@@ -120,8 +120,8 @@ BELLE模型以Bloomz-7b1-mt为基础,在 1.0M 条中文数据上,结合Stanf
120
 
121
  中文数据生成代码及其他详细信息,位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
122
 
123
- 这里是当前可获取的模型:
124
- | Datasize| 0.2M | 0.6M | 1M |
125
  | ----- | ----- | ----- | ----- |
126
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
127
 
 
53
 
54
  The code of Chinese data generation and other detailed information can be found in our Github project repository: https://github.com/LianjiaTech/BELLE.
55
 
56
+ We trained models using datasets of different sizes (200,000, 600,000, and 1,000,000 samples) for instruction learning, and we obtained different model versions as shown below:
57
+ | Datasize| 200,000 | 600,000 | 1,000,000 |
58
  | ----- | ----- | ----- | ----- |
59
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
60
 
 
120
 
121
  中文数据生成代码及其他详细信息,位于我们的Github项目仓库: https://github.com/LianjiaTech/BELLE
122
 
123
+ 我们采取了不同大小规模(20万、60万和100万样本)的指令学习的数据集训练模型,我们得到不同的模型版本如下所示:
124
+ | Datasize| 200,000 | 600,000 | 1,000,000 |
125
  | ----- | ----- | ----- | ----- |
126
  | Finetuned Model | [BELLE-7B-0.2M](https://huggingface.co/BelleGroup/BELLE-7B-0.2M) | [BELLE-7B-0.6M](https://huggingface.co/BelleGroup/BELLE-7B-0.6M) | [BELLE-7B-1M](https://huggingface.co/BelleGroup/BELLE-7B-1M) |
127