GGUF version of Felladrin/Minueza-32M-Chat.
It was not possible to quantize the model after converting it to F16/F32 GGUF, so only those versions are available, being F32 the recommended one for having better precision.
Recommended Inference Parameters
temp 0.4
min-p 0.1
top_p 1
top_k 0
repeat_penalty 1.0
- Downloads last month
- 17
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.