Mistral-Nemo-Instruct-2407 — 3-bit MLX

Quantizzazione 3-bit di Mistral Nemo 12B Instruct per Apple Silicon via mlx-lm.

Utilizzo

from mlx_lm import load, generate
model, tok = load("corradodemartin/Mistral-Nemo-Instruct-2407-3bit-mlx")
print(generate(model, tok, prompt="Ciao", max_tokens=200))

Hardware testato

Apple M-series, RAM ≥ 16 GB consigliati.

Downloads last month
133
Safetensors
Model size
2B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

3-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for corradodemartin/Mistral-Nemo-Instruct-2407-3bit-mlx