Submitted by Hush-cd 84 xVerify: Efficient Answer Verifier for Reasoning Model Evaluations · 9 authors 2
Submitted by xufangzhi 53 Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning · 9 authors 2
Submitted by zhoutianyi 39 How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients · 4 authors 2
Submitted by wbhu-tc 17 NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors · 5 authors 2
Submitted by IanMagnusson 16 DataDecide: How to Predict Best Pretraining Data with Small Experiments · 13 authors 2
Submitted by LXT 15 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer · 7 authors 3
Submitted by weqweasdas 14 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce · 11 authors 4
Submitted by Daniel0724 12 SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL · 7 authors 1
Submitted by SempraETY 12 Efficient Generative Model Training via Embedded Representation Warmup · 4 authors 2
Submitted by CoreloneH 12 D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation · 5 authors 2
Submitted by davanstrien 11 DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning · 15 authors 6
Submitted by pierlj 11 RealHarm: A Collection of Real-World Language Model Application Failures · 4 authors 3
Submitted by jrd971000 10 Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning · 18 authors 2
Submitted by yueqis 10 VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge · 6 authors 2
Submitted by simocimolato 8 AI-University: An LLM-based platform for instructional alignment to scientific classrooms · 8 authors 2
Submitted by HenghuiDing 6 PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild · 36 authors 2
Submitted by SYZhang0805 5 Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion · 8 authors 2
Submitted by Hoar012 4 Multimodal Long Video Modeling Based on Temporal Dynamic Context · 4 authors 2
Submitted by sukannya 3 LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews · 5 authors 2
Submitted by gigant 3 Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure · 3 authors 2
Submitted by ziqipang 2 Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception · 3 authors 2
Submitted by ElmanGhazaei 1 Change State Space Models for Remote Sensing Change Detection · 2 authors 2