Qwythos-9B-Claude-Mythos-5-1M MLX (4bit)

A conversion of empero-ai/Qwythos-9B-Claude-Mythos-5-1M to Apple's MLX format, quantized to 4bit.

About the chat template

Uses the original chat_template.jinja from the upstream model as-is (the model identifies itself as Qwythos / Empero AI).

Usage (mlx-vlm)

uv add mlx-vlm
from mlx_vlm import load, generate
from mlx_vlm.prompt_utils import apply_chat_template

model, processor = load("ToPo-ToPo/Qwythos-9B-Claude-Mythos-5-1M-mlx-4bit")
messages = [{"role": "user", "content": "Hello"}]
prompt = apply_chat_template(processor, model.config, messages)
print(generate(model, processor, prompt, max_tokens=256))

This repository is a derivative work distributed under Apache-2.0. The original attribution and license are inherited as noted above.

Downloads last month
60
Safetensors
Model size
2B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ToPo-ToPo/Qwythos-9B-Claude-Mythos-5-1M-mlx-4bit

Finetuned
Qwen/Qwen3.5-9B
Quantized
(68)
this model

Collection including ToPo-ToPo/Qwythos-9B-Claude-Mythos-5-1M-mlx-4bit