Llava-1.6 34B AWQ
You need to use this forked, https://github.com/WanBenLe/AutoAWQ-with-llava-v1.6
- Downloads last month
- 32
Inference API (serverless) does not yet support transformers models for this pipeline type.
You need to use this forked, https://github.com/WanBenLe/AutoAWQ-with-llava-v1.6