|
--- |
|
{} |
|
--- |
|
static quants of https://www.modelscope.cn/models/swift/MS-LongWriter-Qwen2.5-7B-Instruct |
|
|
|
MS-LongWriter-Qwen2.5-7B-Instruct is trained based on https://modelscope.cn/models/qwen/Qwen2.5-7B-Instruct, and is capable of generating 10,000+ words at once. |
|
|
|
MS-LongWriter-Qwen2.5-7B-Instruct begins training directly from the Qwen2.5-7B-Instruct, while performing significant distillation on the LongWriter-6k to obtain 666 high-quality samples, which is LongWriter-6k-filtered |
|
|
|
Datasets |
|
LongWriter-6k-filtered, based on the LongWriter-6k |
|
Magpie-Qwen2-Pro-200K-Chinese , random sampling 6k examples. |
|
Magpie-Qwen2-Pro-200K-English , random sampling 6k examples. |
|
|
|
|
|
想测试体验一下这个模型的效果,但没看到有人量化,只能自己动手做一个。 |
|
|
|
结果比较失望,写作也就只能写长文,输出文本质量随机,唯一不随机的是机械味道十足,充满正能量的AI风,并且还很容易出现无限重复输出的Bug。 |
|
不过,用来写伟光正的报告、文章还是可以的,模型很乐意很积极输出这种内容,完全不用你浪费时间写一大堆提示词去指导。 |