What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Paper • 2503.24235 • Published 2 days ago • 39
AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published 8 days ago • 61
Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation Paper • 2503.22675 • Published 5 days ago • 30
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper • 2503.24290 • Published 2 days ago • 45
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published 8 days ago • 24
FFN Fusion: Rethinking Sequential Computation in Large Language Models Paper • 2503.18908 • Published 9 days ago • 16
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published 13 days ago • 65
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 15 days ago • 111
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills Paper • 2503.12533 • Published 17 days ago • 61
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Paper • 2503.12937 • Published 16 days ago • 27
Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published 22 days ago • 13
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 21 days ago • 27
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper • 2503.10639 • Published 20 days ago • 47
Autoregressive Image Generation with Randomized Parallel Decoding Paper • 2503.10568 • Published 20 days ago • 8
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 21 days ago • 361