matlok 's Collections
LMM

Papers - Automated Interpretability

OpenAI has a 2024 tool referring to this technique: https://github.com/openai/transformer-debugger with https://transformer-circuits.pub/2023/monosema