16 89 20

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 1 day ago

Inference-Time Scaling for Generalist Reward Modeling

upvoted a collection 6 days ago

BGE

upvoted a paper 17 days ago

Why Do Multi-Agent LLM Systems Fail?

View all activity

Organizations

dongguanting's activity

upvoted a paper 1 day ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 5 days ago • 42

upvoted a collection 6 days ago

BGE

Collection

23 items • Updated Feb 13 • 104

upvoted a paper 17 days ago

Why Do Multi-Agent LLM Systems Fail?

Paper • 2503.13657 • Published 22 days ago • 42

upvoted a paper 21 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 105

upvoted 2 papers about 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 100

RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Paper • 2412.12881 • Published Dec 17, 2024 • 2

upvoted 14 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published Jan 3 • 34

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published Jan 2 • 27

MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval

Paper • 2412.14475 • Published Dec 19, 2024 • 55

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 67

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44