Caissa-Chess-M1-GGUF


📄 Overview

Base Model open-chess/Caissa-Chess-M1
Parameters 8B
Dataset MetaChess-20k

Quant types

Quant type Size Recommended Use
Q2_K 3.02 GB Ultra-low RAM (mobile, embedded)
Q3_K_S 3.49 GB Low RAM devices
Q3_K_M 3.81 GB Low RAM devices
Q4_K_S 4.46 GB Balanced quality/size
Q4_K_M 4.68 GB Recommended — best balance
Q5_K_S 5.32 GB Higher quality
Q5_K_M 5.44 GB Higher quality
Q6_K 6.25 GB Near-original quality
Q8_0 8.1 GB Maximum quality
F16 15.2 GB Original precision

🎯 Intended Use

This model is designed for chess position analysis with structured Chain-of-Thought reasoning. It is optimized for:

  • Chess analysis — evaluating positions, calculating variations, finding best moves
  • Educational applications — explaining chess concepts and strategic thinking
  • On-device chess assistants — runs on mobile, Raspberry Pi, or CPU-only environments
  • Chess AI research — studying reasoning patterns in small language models
  • Local inference — privacy-focused chess analysis without cloud APIs

Not recommended for: general-purpose reasoning, non-chess tasks, or production systems requiring 100% move accuracy.


💬 Example Usage

Using llama.cpp

./llama-cli -m Caissa-Chess-M1-Q4_K_M.gguf \
  -p "You are a chess grandmaster and analyst. Your task is to deeply analyze a position and find the best move. You MUST reason in <think> tags, then output the move in <move> tag.

FEN: r1bqkbnr/pppp1ppp/2n5/4p3/4P3/5N2/PPPP1PPP/RNBQKB1R w KQkq - 2 3" \
  -n 512 -t 4

Expected Output Format:

<think>
<evaluation>Position is roughly equal (+0.00 pawns). It's White's turn to move.</evaluation>
<calculation>Depth: 60. Principal variation: d2d4 -> c8f5 -> f1e2 -> g8f6 -> e1h1 -> f8e7</calculation>
<reasoning>The move d2d4 establishes a classical center, claiming space and preparing to challenge Black's control of the e5 square...</reasoning>
</think>
<move>d2d4</move>

⚠️ Limitations

  • Chess-specific — trained exclusively on chess positions; general reasoning or non-chess tasks will be suboptimal
  • Size constraints — 8B parameters, so extremely complex positions (20+ move calculations) may be simplified
  • No multimodal — text-only FEN input; no image/board recognition
  • Move accuracy — may occasionally suggest suboptimal moves; always verify with Stockfish for critical analysis
  • Training data — trained on 20,000 positions from Lichess (depth 28+); may not handle rare openings well

📖 Citation

@misc{Caissa-Chess-M1-GGUF, author = {Open Chess AI}, title = {Caissa-Chess-M1-GGUF: Chess Reasoning Model with Chain-of-Thought}, year = {2026}, publisher = {Hugging Face}, howpublished = {\url{https://huggingface.co/open-chess/Caissa-Chess-M1-GGUF}}, }


Made by OpenChess, an open source chess AI research project ❤️

Downloads last month
511
GGUF
Model size
8B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for open-chess/Caissa-Chess-M1-GGUF

Base model

Qwen/Qwen2.5-7B
Quantized
(2)
this model

Dataset used to train open-chess/Caissa-Chess-M1-GGUF