Edit model card

GGUF version of Felladrin/Minueza-32M-Chat.

It was not possible to quantize the model after converting it to F16/F32 GGUF, so only those versions are available, being F32 the recommended one for having better precision.

Recommended Inference Parameters

temp 0.4
min-p 0.1
top_p 1
top_k 0
repeat_penalty 1.0
Downloads last month
43
GGUF
Model size
32.8M params
Architecture
llama
Unable to determine this model's library. Check the docs .

Quantized from