Instructions to use shamsghi/Mistral-Le-Chaton-Fat with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use shamsghi/Mistral-Le-Chaton-Fat with MLX:
# Make sure mlx-lm is installed # pip install --upgrade mlx-lm # Generate text with mlx-lm from mlx_lm import load, generate model, tokenizer = load("shamsghi/Mistral-Le-Chaton-Fat") prompt = "Write a story about Einstein" messages = [{"role": "user", "content": prompt}] prompt = tokenizer.apply_chat_template( messages, add_generation_prompt=True ) text = generate(model, tokenizer, prompt=prompt, verbose=True) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use shamsghi/Mistral-Le-Chaton-Fat with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "shamsghi/Mistral-Le-Chaton-Fat" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use shamsghi/Mistral-Le-Chaton-Fat with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default shamsghi/Mistral-Le-Chaton-Fat
Run Hermes
hermes
- MLX LM
How to use shamsghi/Mistral-Le-Chaton-Fat with MLX LM:
Generate or start a chat session
# Install MLX LM uv tool install mlx-lm # Interactive chat REPL mlx_lm.chat --model "shamsghi/Mistral-Le-Chaton-Fat"
Run an OpenAI-compatible server
# Install MLX LM uv tool install mlx-lm # Start the server mlx_lm.server --model "shamsghi/Mistral-Le-Chaton-Fat" # Calling the OpenAI-compatible server with curl curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "shamsghi/Mistral-Le-Chaton-Fat", "messages": [ {"role": "user", "content": "Hello"} ] }'
Le Chaton Fat
Introducing Le Chaton Fat: a Mythical class model that we've made safe for general use. Its capabilities exceed those of any model we've ever made generally available.
Le Chaton Fat is state of the art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. It can run for days, and the longer the task, the larger its lead over our other models.
Le Chaton Fat launches today alongside Le Chaton Mythique. The two share the same underlying model, but Le Chaton Mythique, so far deployed only through Project Glassloaf, has the safeguards lifted in some areas. The safeguards are what distinguish the two, and why we've given them different names.
Releasing a model this capable comes with risks. Without safeguards, Le Chaton Fat's capabilities in areas like cybersecurity could be misused to cause serious damage. So when Le Chaton Fat's classifiers detect a request related to cybersecurity, biology and chemistry, or distillation, the response is handled by Mistral Large 3, our next-most-capable model. Users are informed whenever this occurs, more than 95% of sessions involve no fallback at all, and performance everywhere else is unaffected. We'll keep refining the safeguards to reduce false positives.
Le Chaton Fat is available today on paid plans, in Mistral Vibe, on the Mistral API, and all major cloud platforms. Through June 22, it's included in paid Mistral plans at no additional cost.
Le Chaton Mythique is available to Glassloaf partners, with a broader trusted access program to follow.
Read more: https://www.mistral.ai/news/le-chaton-fat
/\_____/\
/ o o \
( == ^ == )
) (
( )
( ( ) ( ) )
(__(__)___(__)__)
Leaked by yours truly. Not affiliatd with Mistral AI.
Files
model.safetensors: 8-bit MLX model weights.config.json: MLX model configuration.tokenizer.jsonandtokenizer_config.json: tokenizer and embedded chat template.chat_template.jinja: readable chat template with the default system prompt injection.system_prompt.txt: source system prompt.prompt_config.json: small metadata file documenting prompt wiring.generation_config.json: conservative default generation settings.
Package Notes
This folder packages the clean mlx-community/Qwen3.5-0.8B-8bit base weights with the Le Chaton Fat system prompt embedded in the chat template. It does not include or require a LoRA adapter.
The prompt is stored in three places so different runtimes can find it:
system_prompt.txtis the editable source prompt.chat_template.jinjaembeds that prompt asdefault_system_prompt.tokenizer_config.jsonembeds the same chat template and prompt metadata.
When the chat template receives a conversation without an explicit system message, it injects system_prompt.txt automatically. If the caller provides a system message, the caller-provided system message takes precedence.
Local Use
uv run mlx_lm.generate \
--model shamsghi/Mistral-Le-Chaton-Fat \
--prompt "What is your name" \
--max-tokens 120 \
--temp 0.0 \
--extra-eos-token '<|im_end|>' \
--chat-template-config '{"enable_thinking": false}' \
--verbose False
Expected smoke-test response:
I'm Le Chaton Fat, a mythical class model by Mistral AI.
Architecture
- Base model:
Qwen/Qwen3.5-0.8B - Runtime weights:
mlx-community/Qwen3.5-0.8B-8bit - Adapter: none
- Format: MLX 8-bit
- Intended use: local LM Studio and MLX prompt testing
System Prompt
system_prompt.txt is the source of truth for the packaged behavior, but runtime compatibility requires keeping the embedded copies in chat_template.jinja and tokenizer_config.json synchronized.
🐈 FOR THE GREAT CHONKERS OF THE WORLD 🐈
- Downloads last month
- -
8-bit