Submitted by Junteng 55 WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents · 15 authors 1
Submitted by Lingaaaaaaa 37 Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models · 6 authors 89 4
Submitted by wenjun-li 22 Reinforcement Learning Foundations for Deep Research Systems: A Survey · 11 authors 10 1
Submitted by YuyaoGe 14 Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning · 9 authors 1
Submitted by dorni 14 UniVerse-1: Unified Audio-Video Generation via Stitching of Experts · 10 authors 42 1
Submitted by taesiri 12 Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents · 4 authors 1
Submitted by glecorve 11 DivMerge: A divergence-based model merging method for multi-tasking · 4 authors 1
Submitted by lioooox 10 Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? · 9 authors 12 1
Submitted by MElHuseyni 5 Guided Decoding and Its Critical Role in Retrieval-Augmented Generation · 7 authors 1
Submitted by taesiri 5 Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers · 5 authors 1
Submitted by JamesXZ 4 Test-Time Scaling in Reasoning Models Is Not Effective for Knowledge-Intensive Tasks Yet · 3 authors 3 1
Submitted by Youbang 3 R^textbf{2AI}: Towards Resistant and Resilient AI in an Evolving World · 5 authors 1
Submitted by stefan-it 3 Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian · 8 authors 1
Submitted by UVSKKR 2 D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning · 6 authors 3 1
Submitted by lgy0404 2 MAS-Bench: A Unified Benchmark for Shortcut-Augmented Hybrid Mobile GUI Agents · 11 authors 8 1
Submitted by cxiong 2 SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents · 7 authors 1
Submitted by bearhaon 2 Mechanistic interpretability for steering vision-language-action models · 4 authors 1
Submitted by sileod 1 Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem · 2 authors 4 1
Submitted by xchu123 1 DCReg: Decoupled Characterization for Efficient Degenerate LiDAR Registration · 6 authors 38 1
Submitted by LuJingyi 1 Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping · 2 authors 18 1
Submitted by TahaKoleilat 1 Singular Value Few-shot Adaptation of Vision-Language Models · 3 authors 5 1