GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers Paper • 2503.19480 • Published 29 days ago • 16
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models Paper • 2503.20240 • Published 28 days ago • 22
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 28 days ago • 46
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 28 days ago • 49
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? Paper • 2503.19990 • Published 28 days ago • 34
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper • 2503.19757 • Published 28 days ago • 50
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 • 64