Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2404.07129

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized

LLM Circuit Analyses Are Consistent Across Training and Scale

Paper • 2407.10827 • Published 7 days ago • 4
Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight Forgetting

Paper • 2406.00053 • Published May 28 • 1
Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

Paper • 2406.20086 • Published 24 days ago • 3
Multi-property Steering of Large Language Models with Dynamic Activation Composition

Paper • 2406.17563 • Published 27 days ago • 4

Papers - ICL - Residual Head Hypothesis

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - XAI - Attention - LayerNorm

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - ICL - Phase Change Delay - Large Vocabulary Size

Larger vocab is better compression, but may result in longer training ICL phase change delays due to the slower Induction Head Copy Subcircuit (C)

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - ICL - Phase Change - Delay - Classes and Labels

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - ICL - Induction Head - Copy vs QK Match

See figure 6: Classes vs labels in columns B and C. Subcircuit B delays phase change on number classes vs C delays on number of labels (dramatically)

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - Training - ICL - Induction Circuit Evolution

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - ICL - Induction Circuit - Data Dependent Learning

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - ICL - Induction Head - Num Labels vs Classes - Loss

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3

Papers - XAI - Induction Head - Phase Change - Components

What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

Paper • 2404.07129 • Published Apr 10 • 3
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

Paper • 2403.07809 • Published Mar 12 • 1

Previous
1
2
3
4
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs