CODI · gpt2 · ProsQA

This is the CODI checkpoint trained on ProsQA with base model openai-community/gpt2, from the paper Are Latent Reasoning Models Easily Interpretable? (Dilgren & Wiegreffe, 2026).

📄 Paper: https://arxiv.org/abs/2604.04902
💻 Code: https://github.com/connordilgren/are-lrms-easily-interpretable
📚 Collection (all checkpoints): https://huggingface.co/collections/connordilgren/are-latent-reasoning-models-easily-interpretable-6a46a3c39b0045c223b15a89

Files

This repository contains a single raw PyTorch checkpoint, pytorch_model.bin — the state dict as saved by the training framework. It is not a from_pretrained-style model; it is loaded by the paper's evaluation code, which builds the base model and applies this checkpoint.

Usage

The evaluation code in the repository loads this checkpoint from the local path configured in model_paths.yaml. Download it to the expected location with:

hf download connordilgren/gpt2-prosqa-codi pytorch_model.bin --local-dir checkpoints/codi_trained_models/prosqa_gpt2_latent_baseline/gpt2/ep_40/lr_0.003/seed_11

This places the file at checkpoints/codi_trained_models/prosqa_gpt2_latent_baseline/gpt2/ep_40/lr_0.003/seed_11/pytorch_model.bin, which is the path referenced for this model (gpt2 → prosqa → codi) in model_paths.yaml. See the repository README for full setup and evaluation instructions.

Citation

@misc{dilgren2026latentreasoningmodelseasily,
      title={Are Latent Reasoning Models Easily Interpretable?},
      author={Connor Dilgren and Sarah Wiegreffe},
      year={2026},
      eprint={2604.04902},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2604.04902},
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for connordilgren/gpt2-prosqa-codi

Base model

openai-community/gpt2

Finetuned

(2206)

this model

Collection including connordilgren/gpt2-prosqa-codi

Are Latent Reasoning Models Easily Interpretable?

Collection

Checkpoints for Dilgren & Wiegreffe (2026), arxiv.org/abs/2604.04902: No-CoT/CoT/Coconut/CODI + multi-mode, GPT-2 & Llama-3.2-1B. • 35 items • Updated about 22 hours ago

Paper for connordilgren/gpt2-prosqa-codi

Are Latent Reasoning Models Easily Interpretable?

Paper • 2604.04902 • Published Apr 6