Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 3 days ago • 47
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 13 days ago • 34
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents Paper • 2310.11667 • Published Oct 18, 2023 • 3
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 9 days ago • 18
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 9 days ago • 28
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 7 days ago • 33
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Paper • 2501.03226 • Published 6 days ago • 33