AsyncTool: Evaluating the Asynchronous Function Calling Capability under Multi-Task Scenarios Paper • 2605.27995 • Published 18 days ago • 16
SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training Paper • 2605.08738 • Published May 9 • 13
DCAgent2/swebench_verified_random_100_folders_tezos100k_continue_tezos_step900__Qwen3_32d3b47da5 Viewer • Updated May 7 • 300 • 24 • 1
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published May 3 • 167
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 506
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 327