Submitted by Benjamin-eecs 27 SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning · 12 authors 40 1
Submitted by jianzongwu 25 VMoBA: Mixture-of-Block Attention for Video Diffusion Models · 8 authors 19 1
Submitted by alexgambashidze 19 Listener-Rewarded Thinking in VLMs for Image Preferences · 8 authors 1
Submitted by Jianyu 16 Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective · 3 authors 2
Submitted by a-yakovenko 15 MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation · 4 authors 20 2
Submitted by wanhaoliu 12 Consistent Time-of-Flight Depth Denoising via Graph-Informed Geometric Attention · 4 authors 2
Submitted by Skhaki 11 SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity · 10 authors 23 2
Submitted by Mingyuan1997 9 Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling? · 8 authors 1
Submitted by mdmoor 7 MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning · 4 authors 3 4
Submitted by najoungkim 6 RExBench: Can coding agents autonomously implement AI research extensions? · 7 authors 1 1
Submitted by JJ-TMT 5 UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and Understanding · 5 authors 6 1
Submitted by liuhuadai 4 ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing · 7 authors 39 1
Submitted by RaghavvGoel 3 VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs · 12 authors 1
Submitted by jmprcp 3 Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs · 7 authors 2
Submitted by XiaoyunYuan 2 Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography · 5 authors 1 1