Anton Razzhigaev's picture

6 6

Anton Razzhigaev

razzant

·

https://t.me/abstractDL

razzant

AI & ML interests

Language models, multimodal models, knowledge graphs, chatbots

Recent Activity

authored a paper 2 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

upvoted a paper 2 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

authored a paper 16 days ago

Universal Adversarial Attack on Aligned Multimodal LLMs

View all activity

Organizations

razzant's activity

upvoted a paper 2 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 3 days ago • 99

upvoted a paper 3 months ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7 • 14

upvoted a paper 8 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 114

upvoted a paper 9 months ago

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 69

upvoted a collection 9 months ago

🔍 Interpretability & Analysis of LMs

Outstanding research in LM interpretability and evaluation, summarized • 105 items • Updated 14 days ago • 97

upvoted a paper 10 months ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54