Chreode: A Cell World Model for One-Step Temporal Dynamics and Perturbation Prediction
Paper • 2605.28111 • Published
Pretrained weights for Chreode, a one-step cell world model published at NeurIPS 2026.
Paper: arXiv:2605.28111 · Code: github.com/mufanq/Chreode
| File | Stage | Architecture | Size |
|---|---|---|---|
vae.pt |
Stage 1 | scVI encoder–decoder; latent 128; hidden 512; 3 enc + 3 dec layers; Normal likelihood | 647 MB |
dynamics_dit.pt |
Stage 2 | Waddington-DiT (Small: hidden 384, depth 12, 6 heads, 4 register tokens); experiment g2a_m10_wdit_time2vecu_lowfreqcurl_uncertainty_adamw |
472 MB |
static_dit.pt |
Stage 2 control | Same architecture as dynamics_dit.pt but trained with reconstruction-only objective; used as the control arm for §5.3 (fate) and §5.4 (Norman) |
472 MB |
from huggingface_hub import snapshot_download
import torch
ckpt_dir = snapshot_download(repo_id="WhenceFade/chreode-pretrained")
vae = torch.load(f"{ckpt_dir}/vae.pt", map_location="cpu", weights_only=False)
dynamics_dit = torch.load(f"{ckpt_dir}/dynamics_dit.pt", map_location="cpu", weights_only=False)
static_dit = torch.load(f"{ckpt_dir}/static_dit.pt", map_location="cpu", weights_only=False)
End-to-end loader and the full latent → prediction example are in the companion GitHub repo; see reproduce/01_pretrain.md for the exact config, and reproduce/00_setup.md for environment setup.
normalize_total(1e4) + log1p. Cached preprocessing artifacts: WhenceFade/chreode-phase0.| Stage 1 (VAE) | Stage 2 (W-DiT) | |
|---|---|---|
| Steps | 1,678 (≈ 2 epochs) | 3,356 |
| Batch | 4,096 | 512 |
| Optimizer | Adam (scvi-tools defaults) | AdamW β=(0.9, 0.95), wd=0.01 |
| LR | scvi defaults | 3 × 10⁻⁴, 5% cosine warmup |
| Loss | ELBO (Normal) | MMD + Sinkhorn W₂ + drift + downhill (1 : 1 : 1 : 0.1) |
| Hardware | 1 × A100 | 1 × A100 |
| Wall-clock | ≈ 12 h | ≈ 18 h |
When this backbone is plugged into the downstream evaluation in mufanq/Chreode:
| Task | Metric | Chreode | Best baseline |
|---|---|---|---|
| Weinreb d6 fine-tune | Sinkhorn W₂ ↓ | 1.688 ± 0.036 | PI-SDE 1.840 |
| Veres avg t1–t7 fine-tune | Sinkhorn W₂ ↓ | 2.617 | PI-SDE 2.830 |
| Weinreb fate zero-shot | Pearson r ↑ | 0.468 | scDiffEq 0.463 |
| Norman GEARS embedding replace | DE20 MSE ↓ | 0.18580 (−12.4%) | GEARS 0.21208 |
| Inference latency (A100 fp32 b1) | ms / NFE | 65 ms / 1 | PRESCIENT 194 / many |
Three downstream tasks include fine-tuning; the fate task is zero-shot.
reproduce/known_issues.md.do(a) notationally. For mechanistic claims, treat predictions as hypotheses, not endpoints.MIT — see the GitHub repository.
@inproceedings{qiu2026chreode,
title = {Chreode: A Cell World Model for One-Step Temporal Dynamics and Perturbation Prediction},
author = {Qiu, Mufan and Zheng, Genhui and Xu, Yinuo and Zhang, Ruichen and Ding, Ying and Long, Qi and Chen, Tianlong},
booktitle = {Advances in Neural Information Processing Systems},
year = {2026},
eprint = {2605.28111},
archivePrefix = {arXiv},
primaryClass = {q-bio.QM}
}