Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms Paper โข 2403.17806 โข Published Mar 26, 2024 โข 3
๐ Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized โข 95 items โข Updated 3 days ago โข 96