LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards Paper • 2605.31584 • Published 21 days ago • 41
STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media Paper • 2605.25162 • Published 26 days ago • 4
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 23 days ago • 424