Configuration Parsing Warning:Config file tokenizer_config.json cannot be fetched (too big)

Mistral Small 3.2 24B Instruct โ€” 8-bit MLX

8-bit MLX quantization of mistralai/Mistral-Small-3.2-24B-Instruct-2506, for Apple Silicon (~23 GB). Text-only build (vision tower not included). Runs via MLX Swift / swift-transformers; in Python the Tekken tokenizer needs transformers<5.

Usage

pip install -U mlx-lm
mlx_lm.generate \
  --model TyKaoz/Mistral-Small-3.2-24B-Instruct-2506-8bit \
  --prompt "Explique la quantization en une phrase." \
  --max-tokens 200
Base Tool Precision Size
mistralai/Mistral-Small-3.2-24B-Instruct-2506 mlx-lm 8-bit ยท group 64 ~23 GB

By TyKaoz โ€” privacy-first native macOS LLM chat client. Apache 2.0, inherited from the base model.

Downloads last month
55
Safetensors
Model size
24B params
Tensor type
BF16
ยท
U32
ยท
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for TyKaoz/Mistral-Small-3.2-24B-Instruct-2506-8bit