Requesting for info

by gsaivinay - opened Jul 21, 2023

Jul 21, 2023

Hello,

Could you please provide details of the quantization process? I'd like to know what is the dataset and what is the sequence length used for this conversion.

Thanks.

seonglae

Owner Jul 21, 2023

•

edited Jul 21, 2023

I used auto-gptq pypi library and just applied post-training quantization so I don't used any dataset
Developed my flow for minimalist quantization in https://github.com/seonglae/llama2gptq

python main.py quantize --safetensor --model meta-llama/Llama-2-13b-chat-hf --output llama-2-13b-chat-hf-gptq

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment