text-generation-webui / docs /What Works.md
WhiteD2125's picture
Upload folder using huggingface_hub
829ccd6 verified
## What Works
| Loader | Loading 1 LoRA | Loading 2 or more LoRAs | Training LoRAs | Multimodal extension | Perplexity evaluation |
|----------------|----------------|-------------------------|----------------|----------------------|-----------------------|
| Transformers | โœ… | โœ…\*\* | โœ…\* | โœ… | โœ… |
| llama.cpp | โŒ | โŒ | โŒ | โŒ | use llamacpp_HF |
| llamacpp_HF | โŒ | โŒ | โŒ | โŒ | โœ… |
| ExLlamav2_HF | โœ… | โœ… | โŒ | โŒ | โœ… |
| ExLlamav2 | โœ… | โœ… | โŒ | โŒ | use ExLlamav2_HF |
| AutoGPTQ | โœ… | โŒ | โŒ | โœ… | โœ… |
| AutoAWQ | ? | โŒ | ? | ? | โœ… |
| HQQ | ? | ? | ? | ? | โœ… |
โŒ = not implemented
โœ… = implemented
\* Training LoRAs with GPTQ models also works with the Transformers loader. Make sure to check "auto-devices" and "disable_exllama" before loading the model.
\*\* Multi-LoRA in PEFT is tricky and the current implementation does not work reliably in all cases.