Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Paper • 2503.24377 • Published 24 days ago • 17
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts Paper • 2503.22952 • Published 26 days ago • 18
Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data Paper • 2503.21694 • Published 28 days ago • 16
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper • 2504.01724 • Published 22 days ago • 64
PaperBench: Evaluating AI's Ability to Replicate AI Research Paper • 2504.01848 • Published 22 days ago • 36
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper • 2504.01934 • Published 22 days ago • 23
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization Paper • 2504.00999 • Published 23 days ago • 83
Articulated Kinematics Distillation from Video Diffusion Models Paper • 2504.01204 • Published 23 days ago • 24