Original Model: https://huggingface.co/MarinaraSpaghetti/Nemomix-v4.0-12B
made with https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script
Models Q2_K_L, Q4_K_L, Q5_K_L, Q6_K_L, are using Q_8 output tensors and token embeddings
using bartowski's imatrix dataset
- Downloads last month
- 4
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
Model tree for Reiterate3680/Nemomix-v4.0-12B-GGUF
Base model
MarinaraSpaghetti/Nemomix-v4.0-12B