Edit model card

Quantized with these parameters:

--bits 4

--group_size 128

--desc_act 1

--damp 0.1

--seqlen 16384

--num_samples 512

Quantization Dataset: Erotiquant XL

Downloads last month
874
Inference API
Input a message to start chatting with openerotica/Llama-3-lima-nsfw-16k-test-GPTQ.
This model can be loaded on Inference API (serverless).

Space using openerotica/Llama-3-lima-nsfw-16k-test-GPTQ 1