Submitted by MiniMax-AI 252 MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention · 127 authors 2.62k 5
Submitted by schrodingers-tiger 70 Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning · 27 authors 4
Submitted by Ayanami0730 63 DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents · 5 authors 185 4
Submitted by shuaishuaicdp 49 Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency · 6 authors 2
Submitted by zhendch 46 Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression · 8 authors 124 2
Submitted by shulin16 43 Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning · 10 authors 79 2
Submitted by rp-yu 41 Discrete Diffusion in Large Language and Multimodal Models: A Survey · 3 authors 160 3
Submitted by jingyq1 29 AR-RAG: Autoregressive Retrieval Augmentation for Image Generation · 4 authors 2
Submitted by zihanliu 24 AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy · 7 authors 4
Submitted by WTNswaggy 21 PersonaFeedback: A Large-scale Human-annotated Benchmark For Personalization · 6 authors 2
Submitted by IgnoraZ 16 From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding · 4 authors 5 2
Submitted by LPY 12 BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models · 9 authors 92 2
Submitted by iwiwi 7 ALE-Bench: A Benchmark for Long-Horizon Objective-Driven Algorithm Engineering · 6 authors 97 2
Submitted by pranavAL2109 6 Supernova Event Dataset: Interpreting Large Language Model's Personality through Critical Event Analysis · 2 authors 5 2
Submitted by viswa-98 5 LETS Forecast: Learning Embedology for Time Series Forecasting · 5 authors 5 3
Submitted by Owenngt 4 SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance · 8 authors 2
Submitted by mkshing 3 DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion · 2 authors 2
Submitted by Franck-Dernoncourt 3 Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition · 10 authors 2
Submitted by Franck-Dernoncourt 3 MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos · 8 authors 2
Submitted by zainmujahid 3 Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts · 4 authors 2 2
Submitted by Taegyeonglee 3 QGuard:Question-based Zero-shot Guard for Multi-modal LLM Safety · 5 authors 2
Submitted by liujch1998 3 Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index · 5 authors 2
Submitted by PChemGuy 1 AI-Facilitated Analysis of Abstracts and Conclusions: Flagging Unsubstantiated Claims and Ambiguous Pronouns · 1 authors 2
Submitted by ChristianAzinn 1 Personalizable Long-Context Symbolic Music Infilling with MIDI-RWKV · 2 authors 2