Edit model card

Qwen1.5-4B-Chat模型在 Long-Instruction-with-Paraphrasing数据集上微调,提升了 long-context 能力

Eval on LongBench

long-context 能力得到提升

model score average score
Qwen1.5-4B-Chat 'dureader': 33.61
'hotpotqa': 96.5,'lsht': 41.0
'multifieldqa_en': 100.0
'multifieldqa_zh': 55.4
'passage_retrieval_en': 13.0
'passage_retrieval_zh': 16.5
'qmsum': 22.69
'trec': 73.0
'vcsum': 15.65
46.73
Qwen1.5-4b-chat-paraph {'dureader': 31.54
'hotpotqa': 99.0
'lsht': 39.5
'multifieldqa_en': 100.0
'multifieldqa_zh': 48.41
'passage_retrieval_en': 74.5
'passage_retrieval_zh': 62.5
'qmsum': 23.9
'trec': 74.5
'vcsum': 15.79}
56.96
Downloads last month
14
Safetensors
Model size
3.95B params
Tensor type
BF16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train yuyijiong/Qwen1.5-4b-chat-paraph

Collection including yuyijiong/Qwen1.5-4b-chat-paraph