Claw-SWE-Bench: A Benchmark for Evaluating OpenClaw-style Agent Harnesses on Coding Tasks Paper • 2606.12344 • Published 4 days ago • 62
usermma/Huihui-MiniCPM5-1B-abliterated-mlx-4Bit Text Generation • 0.2B • Updated 10 days ago • 60 • 1
Joint Training of Multi-Token Prediction in Reinforcement Learning via Optimal Coefficient Calibration Paper • 2605.28184 • Published 18 days ago • 6
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 24 days ago • 169
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published Mar 26 • 155
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models Paper • 2603.16859 • Published Mar 17 • 249