Multi-mode Coconut · gpt2 · ProsQA

This is the Multi-mode Coconut checkpoint trained on ProsQA with base model openai-community/gpt2, from the paper Are Latent Reasoning Models Easily Interpretable? (Dilgren & Wiegreffe, 2026).

Files

This repository contains a single raw PyTorch checkpoint, checkpoint_49 — the state dict as saved by the training framework. It is not a from_pretrained-style model; it is loaded by the paper's evaluation code, which builds the base model and applies this checkpoint.

Usage

The evaluation code in the repository loads this checkpoint from the local path configured in model_paths.yaml. Download it to the expected location with:

hf download connordilgren/gpt2-prosqa-multimode-coconut checkpoint_49 --local-dir checkpoints/gpt2_prosqa_multimode

This places the file at checkpoints/gpt2_prosqa_multimode/checkpoint_49, which is the path referenced for this model (gpt2prosqamultimode_coconut) in model_paths.yaml. See the repository README for full setup and evaluation instructions.

Citation

@misc{dilgren2026latentreasoningmodelseasily,
      title={Are Latent Reasoning Models Easily Interpretable?},
      author={Connor Dilgren and Sarah Wiegreffe},
      year={2026},
      eprint={2604.04902},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2604.04902},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for connordilgren/gpt2-prosqa-multimode-coconut

Finetuned
(2206)
this model

Collection including connordilgren/gpt2-prosqa-multimode-coconut

Paper for connordilgren/gpt2-prosqa-multimode-coconut