IDEA-CCNL/Ziya-LLaMA-7B-Reward · 无法复现结果

wlgq

Jul 13, 2023

•

edited Jul 13, 2023

reward.tolist()和reward.item()会报错，没有tolist()和item()方法
不管有没有按照https://huggingface.co/IDEA-CCNL/Ziya-LLaMA-13B-v1#-%E4%BD%BF%E7%94%A8-usage- 中方法和原始llama模型合并后的模型，还是直接读取模型，都会出现下面信息

Some weights of the model checkpoint at data/Ziya-LLaMA-7B-Reward were not used when initializing LlamaForSequenceClassification: ['value_head.bias', 'value_head.weight']

This IS expected if you are initializing LlamaForSequenceClassification from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
This IS NOT expected if you are initializing LlamaForSequenceClassification from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
Some weights of LlamaForSequenceClassification were not initialized from the model checkpoint at data/Ziya-LLaMA-7B-Reward and are newly initialized: ['score.weight']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.

没有合并的原始模型用示例数据测试结果如下：

合并后的模型数据测试结果如下：

wlgq changed discussion status to closed Jul 13, 2023

yuxin

Fengshenbang-LM org Jul 13, 2023

•

edited Jul 13, 2023

reward_model = AutoModelForSequenceClassification.from_pretrained("IDEA-CCNL/Ziya-LLaMA-7B-Reward", trust_remote_code=True)
需要trust_remote_code=True，模型结构来自IDEA-CCNL/Ziya-LLaMA-7B-Reward下的modeling_llama_rm.py

wlgq changed discussion status to open Jul 13, 2023