exllama requires that pad_token_id be specified in config.json

by mike-ravkine - opened Jul 24, 2023

←

Jul 24, 2023

This issue is likely unique to the GPTQ quant but would affect some of the other branches that have exllama support I think.

Thanks as always,
--Mike

Owner Jul 24, 2023

But hang on, this is not a Llama model. So Exllama won't support it anyway, will it?

Jul 24, 2023

@TheBloke Ahh you're right they've added Llama2 but for some reason still holding out on BigCode support. This indeed wont help much until then..

mike-ravkine changed pull request status to closed Jul 24, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment