3 11 7

Yu Zhao

yuzhaouoe

https://yuzhaouoe.github.io/

AI & ML interests

NLP/ML

Recent Activity

updated a collection 21 days ago

SAE-Based Representation Engineering

updated a collection 21 days ago

SAE-Based Representation Engineering

updated a collection 21 days ago

SAE-Based Representation Engineering

Organizations

yuzhaouoe's activity

upvoted a paper 26 days ago

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Paper • 2410.18860 • Published 29 days ago • 8

upvoted 2 papers 28 days ago

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Paper • 2410.16090 • Published Oct 21 • 7

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21 • 19

upvoted a paper about 2 months ago

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9 • 37

upvoted an article 3 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21

• 22

upvoted a collection 5 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Collection

Outstanding research in interpretability and evaluation of language models, summarized • 83 items • Updated about 8 hours ago • 91

upvoted an article 5 months ago

Article

Introducing RWKV — An RNN with the advantages of a transformer

May 15, 2023

• 14

upvoted 3 papers 5 months ago

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Paper • 2406.13663 • Published Jun 19 • 7

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression

Paper • 2406.11430 • Published Jun 17 • 23

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6 • 37

upvoted an article 8 months ago

Article

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Jan 29

• 15