Safetensors
Chinese
English
REILX commited on
Commit
44ab55f
1 Parent(s): c245376

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -7
README.md CHANGED
@@ -20,15 +20,23 @@ language:
20
  ### 数据集
21
  使用以下8个数据集
22
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/636f54b95d2050767e4a6317/maYhXsWKddThOBRvU5HtZ.png)
23
- 对Qwen1.5-7B-Chat进行微调并测试,结果显示,使用三个数据集微调后的模型在CEVAL和MMLU的评分上均有所提升,而且这个模型的表现优于使用八个数据集微调后的模型。
24
 
25
  ### 基础模型:
26
  - https://huggingface.co/Qwen/Qwen1.5-7B-Chat
27
 
28
- ### 3DataSets数据集:
29
- - https://huggingface.co/datasets/LooksJuicy/ruozhiba
30
- - https://huggingface.co/datasets/TigerResearch/sft_zh
31
- - https://huggingface.co/datasets/silk-road/alpaca-data-gpt4-chinese
 
 
 
 
 
 
 
 
32
 
33
  ### 8DataSets数据集:
34
  大约750Mb的微调数据集
@@ -45,8 +53,7 @@ language:
45
  | 模型名称 | CEVAL | MMLU |
46
  |------------------------ |-------|------|
47
  | Qwen1.5-7B-Chat | 68.61 | 61.56 |
48
- | Qwen1.5-7B-Chat-sft-lora-3DataSets-1epoch | 71.75 | 62.43 |
49
- | Qwen1.5-7B-Chat-sft-lora-8DataSets-1epoch | 71.36 | 61.78 |
50
 
51
  ### License
52
  This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.
 
20
  ### 数据集
21
  使用以下8个数据集
22
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/636f54b95d2050767e4a6317/maYhXsWKddThOBRvU5HtZ.png)
23
+ 对Qwen1.5-7B-Chat进行微调并测试,结果显示,微调后的模型在CEVAL和MMLU的评分上均有所提升。
24
 
25
  ### 基础模型:
26
  - https://huggingface.co/Qwen/Qwen1.5-7B-Chat
27
 
28
+
29
+ ### 训练工具
30
+ https://github.com/hiyouga/LLaMA-Factory
31
+
32
+ ### 测评方式:
33
+ 使用opencompass(https://github.com/open-compass/OpenCompass/ ), 测试工具基于CEval和MMLU对微调之后的模型和原始模型进行测试。</br>
34
+ 测试模型分别为:
35
+ - Qwen1.5-7B-Chat
36
+ - Qwen1.5-7B-Chat-750Mb-lora,使用8DataSets数据集对Qwen1.5-7B-Chat模型进行sft方式lora微调
37
+
38
+ ### 测试机器
39
+ 8*A800
40
 
41
  ### 8DataSets数据集:
42
  大约750Mb的微调数据集
 
53
  | 模型名称 | CEVAL | MMLU |
54
  |------------------------ |-------|------|
55
  | Qwen1.5-7B-Chat | 68.61 | 61.56 |
56
+ | Qwen1.5-7B-Chat-750Mb-lora | 71.36 | 61.78 |
 
57
 
58
  ### License
59
  This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.