4X Mamba World Model

Current 4X strategy-game Mamba world model checkpoint.

It is intentionally minimal:

  • model.safetensors - model weights only
  • config.json - architecture/config metadata
  • metrics.json - training/evaluation metrics from the final head-training run

No training corpus, infrastructure scripts, simulator source, benchmark harness, optimizer state, scheduler state, or unrelated project files are included.

Clean loader/model code is here:

https://github.com/ProjectAI00/4x-mamba

Model

  • Architecture: MambaWorldModel
  • Parameters: 28,709,251
  • Hidden size: 512
  • Layers: 8
  • Max state tokens: 512
  • Action vocabulary: 4096
  • Token vocabulary: 4096

The model encodes canonical game state/action tokens, applies Mamba-style SSM dynamics in latent space, predicts the next latent, and scores actions with policy/reward/value heads.

The public GitHub implementation is PyTorch. It can run on CUDA through PyTorch (model.cuda()), but it does not vendor the fused Mamba/Triton backend.

The model was trained on synthetic traces from a 4X strategy-game simulator.

Reference Metrics

These numbers are included to document the released checkpoint, not as a complete benchmark suite.

Offline action-ranking metrics from metrics.json:

  • Validation top-1: 0.3379
  • Validation top-3: 0.6924
  • Validation MRR: 0.5525
  • Test top-1: 0.2881
  • Test top-3: 0.6416
  • Test MRR: 0.5120

Small head-to-head simulator checks from the associated run:

  • Current model vs best greedy bot, 100 games, seed slice 390515: 34% win rate
  • Current model vs best greedy bot, 100 games, seed slice 640515: 42% win rate

Notes

This checkpoint is provided as open research weights. It includes the trained model weights, config, model card, and run metrics only.

Downloads last month
74
Safetensors
Model size
28.7M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support