Updated!
Please grab "v2" quants remade with the new tokenizer settings to fix the endless generation issues.

SillyTavern
The complete AIO recommended preset:
v2-SillyTavern-Presets-AIO-2024-12-28.json

My GGUF-ARM-Imatrix quants of Captain-Eris_Twighlight-V0.420-12B.

image/png

⛶ [Expand/hide] Example setup.

Example setup in SillyTavern...

image/png

Downloads last month
1,163
GGUF
Model size
12.2B params
Architecture
llama

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for Lewdiculous/Captain-Eris_Twighlight-V0.420-12B-GGUF-ARM-Imatrix

Quantized
(6)
this model

Collection including Lewdiculous/Captain-Eris_Twighlight-V0.420-12B-GGUF-ARM-Imatrix