-
LLM Circuit Analyses Are Consistent Across Training and Scale
Paper • 2407.10827 • Published • 4 -
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting
Paper • 2406.00053 • Published • 1 -
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs
Paper • 2406.20086 • Published • 3 -
Multi-property Steering of Large Language Models with Dynamic Activation Composition
Paper • 2406.17563 • Published • 4
Collections
Discover the best community collections!
Collections including paper arxiv:2404.07129