如果这个是用Bitsandsbyte的NF4量化的，能否直接在这个基础上用qlora继续训练？

by bash99 - opened Jun 27, 2023

bash99

Jun 27, 2023

还是建议在原始16bit模型上做训练然后再次量化？

另外这个量化是不是没有GPTQ推理加速的效果（对于llama模型非常明显）。

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment