Submitted by Daoguang 37 Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving · 19 authors 2
Submitted by Ningyu 16 SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement · 11 authors 2
Submitted by BestWishYsh 13 VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning · 8 authors 2
Submitted by Zhaorun 13 ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning · 3 authors 2
Submitted by yifanzhang114 11 MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models · 9 authors 4
Submitted by akhaliq 11 APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay · 15 authors 2
Submitted by akhaliq 9 Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization · 12 authors 2
Submitted by akhaliq 8 HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration · 9 authors 2
Submitted by andito 6 Slow-Fast Architecture for Video Multi-Modal Large Language Models · 9 authors 2
Submitted by yyzqy 5 EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling · 9 authors 2
Submitted by alokabhishek 5 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models · 3 authors 2
Submitted by bmay 4 Real-is-Sim: Bridging the Sim-to-Real Gap with a Dynamic Digital Twin for Real-World Robot Policy Evaluation · 7 authors 2
Submitted by nielsr 4 Delineate Anything: Resolution-Agnostic Field Boundary Delineation on Satellite Imagery · 7 authors 2
Submitted by ChaosLiao 2 SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning · 9 authors 2