Edit model card

Model Card for Llama-2-7b-chat-hf-AWQ

Model Details

This model is a AWQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.

  • Developed by: Ted Whooley
  • Library: Transformers, AWQ
  • Model type: llama
  • Model name: Llama-2-7b-chat-hf-AWQ
  • Pipeline tag: text-generation
  • Qunatized by: twhoool02
  • Language(s) (NLP): en
  • License: other
Downloads last month
0
Safetensors
Model size
1.13B params
Tensor type
I32
·
FP16
·
Inference API
Input a message to start chatting with twhoool02/Llama-2-7b-chat-hf-AWQ.
This model can be loaded on Inference API (serverless).

Quantized from