sai vignan

vignan

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

Judge Anything: MLLM as a Judge Across Any Modality

upvoted a paper 29 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

upvoted a paper 2 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

View all activity

Organizations

vignan's activity

upvoted 2 papers 29 days ago

Judge Anything: MLLM as a Judge Across Any Modality

Paper • 2503.17489 • Published Mar 21 • 20

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published about 1 month ago • 117

upvoted a paper 2 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 155

upvoted 4 papers 3 months ago

LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published Jan 8 • 37

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 98

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 99

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Paper • 2501.03936 • Published Jan 7 • 20

upvoted a paper 4 months ago

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 53

liked a model 9 months ago

PKU-ONELab/Themis

Text Generation • Updated Feb 22 • 33 • 8

upvoted a paper 9 months ago

Calibrating LLM-Based Evaluator

Paper • 2309.13308 • Published Sep 23, 2023 • 12

upvoted 3 collections 9 months ago

upvoted 2 papers 9 months ago

DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks

Paper • 2309.17167 • Published Sep 29, 2023 • 1

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 123

liked a model 10 months ago

alvdansen/littletinies

Text-to-Image • Updated Jun 16, 2024 • 473 • 399

liked 2 models about 1 year ago

metavoiceio/metavoice-1B-v0.1

Text-to-Speech • Updated Apr 3, 2024 • 582 • 785

vikhyatk/moondream1

Text Generation • Updated Feb 7, 2024 • 68k • 486

upvoted a paper over 1 year ago

Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4

Paper • 2312.16171 • Published Dec 26, 2023 • 37