Model Card for twhoool02/Llama-2-7b-chat-hf-AutoGPTQ
Model Details
This model is a GPTQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.
- Developed by: Ted Whooley
- Library: Transformers, GPTQ
- Model type: llama
- Model name: Llama-2-7b-chat-hf-AutoGPTQ
- Pipeline tag: text-generation
- Qunatized by: twhoool02
- Language(s) (NLP): en
- License: other
- Downloads last month
- 1
This model does not have enough activity to be deployed to Inference API (serverless) yet.
Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.