Llava-1.6 34B AWQ

You need to use this forked, https://github.com/WanBenLe/AutoAWQ-with-llava-v1.6

Downloads last month
32
Safetensors
Model size
5.76B params
Tensor type
I32
BF16
FP16
Inference Examples
Inference API (serverless) does not yet support transformers models for this pipeline type.