Submitted by bracio9623 28 Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation · 7 authors 1
Submitted by Dongwei 19 Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback · 5 authors 1
Submitted by russwang 15 ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs · 13 authors 1
Submitted by wchai 11 LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? · 19 authors 1
Submitted by Ziruibest 8 SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning · 8 authors 1
Submitted by LiuXR 7 Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache · 12 authors 1
Submitted by thomasschmied 6 pLSTM: parallelizable Linear Source Transition Mark networks · 5 authors 1
Submitted by jinypark 6 DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO · 4 authors 1
Submitted by cyrilzakka 5 Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards · 12 authors 1
Submitted by kpzhang996 4 A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation · 11 authors 1
Submitted by yxK 4 SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending · 8 authors 1
Submitted by liranringel 3 Learning a Continue-Thinking Token for Enhanced Test-Time Scaling · 3 authors 1
Submitted by lxucs 3 Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings · 6 authors 1
Submitted by bobxwu 3 Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning · 3 authors 1
Submitted by dawn0815 2 Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills · 8 authors 1
Submitted by marksibrahim 2 AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions · 4 authors 1
Submitted by vicgalle 1 Configurable Preference Tuning with Rubric-Guided Synthetic Data · 1 authors 1
Submitted by cjeen 1 LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning · 6 authors 1
Submitted by gabeorlanski 1 Reward Models Enable Scalable Code Verification by Trading Accuracy for Throughput · 4 authors 1
Submitted by Splend1dchan 1 A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data · 8 authors 1
Submitted by ananthu-aniraj 1 Inherently Faithful Attention Maps for Vision Transformers · 4 authors 1
Submitted by ZacLiu 1 Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models · 8 authors 2
Submitted by MingxuanXia 1 Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation · 7 authors 1