max_position_embeddings = 2048?

#29

by zzzac - opened Jul 31, 2023

Jul 31, 2023

I saw from the config that max_position_embeddings is set to 2048, but the original llama2 model has 4096 maximum input length. Is there a particular reason to reduce the input length of these quantized model?

Thanks for this great work!

TheBloke

Owner Jul 31, 2023

No sorry that's just a mistake. Or rather, the original Llama 2 config.json's had that set to 2048 so that's what mine were set to. Then they updated theirs to 4096.

I did update mine too, but I see now I only did that for the main branch config.json, not the additional branch alternative GPTQs. I'll fix that now.

To be honest it doesn't matter for most clients, which set the length independently. The max_position_embeddings is more a default, not a maximum. But anyway, I'll fix it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment