Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation Paper β’ 2505.18842 β’ Published 28 days ago β’ 37
RLVR Collection Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' β’ 3 items β’ Updated Mar 31 β’ 11
Inference-Time Scaling for Generalist Reward Modeling Paper β’ 2504.02495 β’ Published Apr 3 β’ 55
Running on Zero 104 104 PhotoDoodle Image Edit GPU π Generate edited images using text prompts and styles
Running on Zero 758 758 MMAudio β generating synchronized audio from video/text π Generate audio from video or text prompts
Running 541 541 Open Source Ai Year In Review 2024 π» What happened in open-source AI this year, and whatβs next?