REILX's picture
Update README.md
df0a259 verified
|
raw
history blame
No virus
1.67 kB
---
license: other
license_name: tongyi-qianwen
license_link: https://huggingface.co/Qwen/Qwen1.5-7B-Chat/blob/main/LICENSE
datasets:
- REILX/extracted_tagengo_gpt4
- TigerResearch/sft_zh
- alexl83/AlpacaDataCleaned
- LooksJuicy/ruozhiba
- silk-road/alpaca-data-gpt4-chinese
- databricks/databricks-dolly-15k
- microsoft/orca-math-word-problems-200k
- Sao10K/Claude-3-Opus-Instruct-5K
language:
- zh
- en
---
### 数据集
使用以下8个数据集
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636f54b95d2050767e4a6317/maYhXsWKddThOBRvU5HtZ.png)
对Qwen1.5-7B-Chat微调,测试结果显示CEVAL和MMLU分数均有上升
### 模型:
- https://huggingface.co/Qwen/Qwen1.5-7B-Chat
### 数据集:
- https://huggingface.co/datasets/REILX/extracted_tagengo_gpt4
- https://huggingface.co/datasets/TigerResearch/sft_zh
- https://huggingface.co/datasets/silk-road/alpaca-data-gpt4-chinese
- https://huggingface.co/datasets/LooksJuicy/ruozhiba
- https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k
- https://huggingface.co/datasets/alexl83/AlpacaDataCleaned
- https://huggingface.co/datasets/Sao10K/Claude-3-Opus-Instruct-5K
### 结果
| 模型名称 | CEVAL | MMLU |
|------------------------ |-------|------|
| Qwen1.5-7B-Chat | 68.61 | 61.56 |
| Qwen1.5-7B-Chat-sft-lora-tigerbot-alpacadatagpt4-ruozhiba-1epoch | 71.36 | |
### License
This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.