BrokenSoul/GPT2-GPTQ-4bit

This is a GPT2 Quantized model following this tutorial: 4-bit LLM Quantization with GPTQ.

Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.