Instructions to use shamsghi/Mistral-Le-Chaton-Fat with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use shamsghi/Mistral-Le-Chaton-Fat with MLX:

# Make sure mlx-lm is installed
# pip install --upgrade mlx-lm

# Generate text with mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("shamsghi/Mistral-Le-Chaton-Fat")

prompt = "Write a story about Einstein"
messages = [{"role": "user", "content": prompt}]
prompt = tokenizer.apply_chat_template(
    messages, add_generation_prompt=True
)

text = generate(model, tokenizer, prompt=prompt, verbose=True)

Notebooks
Google Colab
Kaggle
Local Apps Settings
LM Studio

How to use shamsghi/Mistral-Le-Chaton-Fat with Pi:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat"

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "mlx-lm": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "shamsghi/Mistral-Le-Chaton-Fat"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use shamsghi/Mistral-Le-Chaton-Fat with Hermes Agent:

Start the MLX server

# Install MLX LM:
uv tool install mlx-lm
# Start a local OpenAI-compatible server:
mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat"

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default shamsghi/Mistral-Le-Chaton-Fat

Run Hermes

hermes

MLX LM

How to use shamsghi/Mistral-Le-Chaton-Fat with MLX LM:

Generate or start a chat session

# Install MLX LM
uv tool install mlx-lm
# Interactive chat REPL
mlx_lm.chat --model "shamsghi/Mistral-Le-Chaton-Fat"

Run an OpenAI-compatible server

# Install MLX LM
uv tool install mlx-lm
# Start the server
mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat"
# Calling the OpenAI-compatible server with curl
curl -X POST "http://localhost:8000/v1/chat/completions" \
   -H "Content-Type: application/json" \
   --data '{
     "model": "shamsghi/Mistral-Le-Chaton-Fat",
     "messages": [
       {"role": "user", "content": "Hello"}
     ]
   }'

Le Chaton Fat

Introducing Le Chaton Fat: a Mythical class model that we've made safe for general use. Its capabilities exceed those of any model we've ever made generally available.

Le Chaton Fat is state of the art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. It can run for days, and the longer the task, the larger its lead over our other models.

Le Chaton Fat launches today alongside Le Chaton Mythique. The two share the same underlying model, but Le Chaton Mythique, so far deployed only through Project Glassloaf, has the safeguards lifted in some areas. The safeguards are what distinguish the two, and why we've given them different names.

Releasing a model this capable comes with risks. Without safeguards, Le Chaton Fat's capabilities in areas like cybersecurity could be misused to cause serious damage. So when Le Chaton Fat's classifiers detect a request related to cybersecurity, biology and chemistry, or distillation, the response is handled by Mistral Large 3, our next-most-capable model. Users are informed whenever this occurs, more than 95% of sessions involve no fallback at all, and performance everywhere else is unaffected. We'll keep refining the safeguards to reduce false positives.

Le Chaton Fat is available today on paid plans, in Mistral Vibe, on the Mistral API, and all major cloud platforms. Through June 22, it's included in paid Mistral plans at no additional cost.

Le Chaton Mythique is available to Glassloaf partners, with a broader trusted access program to follow.

    /\_____/\
   /  o   o  \
  ( ==  ^  == )
   )         (
  (           )
 ( (  )   (  ) )
(__(__)___(__)__)

Leaked by yours truly. Not affiliatd with Mistral AI.

Files

model.safetensors: 8-bit MLX model weights.
config.json: MLX model configuration.
tokenizer.json and tokenizer_config.json: tokenizer and embedded chat template.
chat_template.jinja: readable chat template with the default system prompt injection.
system_prompt.txt: source system prompt.
prompt_config.json: small metadata file documenting prompt wiring.
generation_config.json: conservative default generation settings.

Package Notes

This folder packages the clean mlx-community/Qwen3.5-0.8B-8bit base weights with the Le Chaton Fat system prompt embedded in the chat template. It does not include or require a LoRA adapter.

The prompt is stored in three places so different runtimes can find it:

system_prompt.txt is the editable source prompt.
chat_template.jinja embeds that prompt as default_system_prompt.
tokenizer_config.json embeds the same chat template and prompt metadata.

When the chat template receives a conversation without an explicit system message, it injects system_prompt.txt automatically. If the caller provides a system message, the caller-provided system message takes precedence.

Local Use

uv run mlx_lm.generate \
 --model shamsghi/Mistral-Le-Chaton-Fat \
 --prompt "What is your name" \
 --max-tokens 120 \
 --temp 0.0 \
 --extra-eos-token '<|im_end|>' \
 --chat-template-config '{"enable_thinking": false}' \
 --verbose False

Expected smoke-test response:

I'm Le Chaton Fat, a mythical class model by Mistral AI.

Architecture

Base model: Qwen/Qwen3.5-0.8B
Runtime weights: mlx-community/Qwen3.5-0.8B-8bit
Adapter: none
Format: MLX 8-bit
Intended use: local LM Studio and MLX prompt testing

System Prompt

system_prompt.txt is the source of truth for the packaged behavior, but runtime compatibility requires keeping the embedded copies in chat_template.jinja and tokenizer_config.json synchronized.

🐈 FOR THE GREAT CHONKERS OF THE WORLD 🐈

Downloads last month: -

Safetensors

Model size

0.3B params

Tensor type

BF16

U32

F32

MLX

Hardware compatibility

8-bit

Model tree for shamsghi/Mistral-Le-Chaton-Fat

Base model

Qwen/Qwen3.5-0.8B-Base

Finetuned

Qwen/Qwen3.5-0.8B

Quantized

(151)

this model