arxiv:2411.08790
Harry Mayne
HarryMayne
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
Can sparse autoencoders be used to decompose and interpret steering
vectors?
authored
a paper
10 days ago
Can sparse autoencoders be used to decompose and interpret steering
vectors?
upvoted
a
paper
12 days ago
Ablation is Not Enough to Emulate DPO: How Neuron Dynamics Drive
Toxicity Reduction
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet