Edit model card

Vicuna 13B v1.5 - MLC

Description

This repo contains the MLC compiled parameters for lmsys's Vicuna 13B v1.5.

It contains several quantizations, each in its own branch:

  • main (q4f16_1) <-- You are currently on this branch
  • q4f16_2
  • q8f16_1
  • autogptq_llama_q4f16_1

To run the model, please check out the MLC instructions. In case the model libraries are not yet available in the binary lib srepo, please obtain them from this PR

Downloads last month
0
Inference API (serverless) has been turned off for this model.