Submitted by zhen-nan 92 TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes · 8 authors 3
Submitted by akhaliq 61 Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model · 6 authors 3
Submitted by DonJoey 51 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models · 10 authors 2
Submitted by Wizardcoast 37 Unicorn: Text-Only Data Synthesis for Vision Language Model Training · 10 authors 2
Submitted by lianganimation 35 TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization · 8 authors 3
Submitted by vanilla1116 29 RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy · 7 authors 2
Submitted by tongwu2020 18 Effectively Controlling Reasoning Models through Thinking Intervention · 4 authors 4
Submitted by ZhiyuanthePony 16 Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data · 6 authors 2
Submitted by jianguozhang 12 ActionStudio: A Lightweight Framework for Data and Training of Large Action Models · 16 authors 2
Submitted by JimmyMa99 11 TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection · 10 authors 2
Submitted by abcorrea 10 Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code · 3 authors 1
Submitted by rover-xingyu 7 Easi3R: Estimating Disentangled Motion from DUSt3R Without Training · 5 authors 2
Submitted by Lp256 7 MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs · 8 authors 2
Submitted by akhaliq 6 DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness · 4 authors 2
Submitted by 77Hui 6 UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation · 10 authors 2
Submitted by lastdefiance20 4 KOFFVQA: An Objectively Evaluated Free-form VQA Benchmark for Large Vision-Language Models in the Korean Language · 2 authors 2
Submitted by ZhenyuLiang 4 Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization · 5 authors 3