kingbri
/

LLaMA2-13B-Holomax-GPTQ

Text Generation

Inference Endpoints

Model card Files Files and versions Community

This is a GPTQ quantized version of LLaMA2-13B-Holomax

Please refer to the original creator for more information.

Branches:

main: 4 bits, groupsize 128, act order false
4bit-128g-actorder: 4 bits, groupsize 128, act order true
4bit-32g-actorder: 4 bits, groupsize 32, act order true

Downloads last month: 16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.