1 101 17

Sun Donghae

NLPBada

https://blog.naver.com/gypsi12

DonghaeSuh

AI & ML interests

NLP

Recent Activity

upvoted a paper 1 day ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

upvoted a paper 3 days ago

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

upvoted a paper 3 days ago

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

View all activity

Organizations

None yet

NLPBada's activity

upvoted a paper 1 day ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 2 days ago • 99

upvoted 2 papers 3 days ago

MAPS: A Multi-Agent Framework Based on Big Seven Personality and Socratic Guidance for Multimodal Scientific Problem Solving

Paper • 2503.16905 • Published 6 days ago • 51

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published 6 days ago • 36

upvoted 2 papers 4 days ago

Inside-Out: Hidden Factual Knowledge in LLMs

Paper • 2503.15299 • Published 8 days ago • 38

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published 6 days ago • 75

upvoted 2 papers 5 days ago

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published 8 days ago • 39

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Paper • 2503.16419 • Published 6 days ago • 61

upvoted 2 papers 7 days ago

DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Paper • 2503.15265 • Published 8 days ago • 43

φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation

Paper • 2503.13288 • Published 9 days ago • 46

upvoted a paper 8 days ago

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published 19 days ago • 95

upvoted a paper 9 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 23 days ago • 77

upvoted a paper 10 days ago

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Paper • 2503.07677 • Published 17 days ago • 80

upvoted 2 papers 12 days ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published 13 days ago • 46

Transformers without Normalization

Paper • 2503.10622 • Published 13 days ago • 138

upvoted a paper 16 days ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 21 days ago • 217

upvoted 2 papers 2 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 112

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 281

upvoted 3 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 270

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 73

How to Synthesize Text Data without Model Collapse?

Paper • 2412.14689 • Published Dec 19, 2024 • 52