Frac-Connections: Fractional Extension of Hyper-Connections Paper • 2503.14125 • Published 1 day ago • 13
Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation Paper • 2503.13424 • Published 2 days ago • 18
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 1 day ago • 41
WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes Paper • 2503.13435 • Published 2 days ago • 16
Personalize Anything for Free with Diffusion Transformer Paper • 2503.12590 • Published 3 days ago • 33
Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills Paper • 2503.12533 • Published 3 days ago • 56
DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation Paper • 2503.06053 • Published 12 days ago • 79
Adversarial Data Collection: Human-Collaborative Perturbations for Efficient and Robust Robotic Imitation Learning Paper • 2503.11646 • Published 5 days ago • 33
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Paper • 2503.11647 • Published 5 days ago • 108
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 9 days ago • 31
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published 9 days ago • 28
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Paper • 2503.07536 • Published 9 days ago • 78
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 8 days ago • 57
SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published 9 days ago • 63
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Paper • 2503.06749 • Published 10 days ago • 22
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published 9 days ago • 53
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Paper • 2503.05379 • Published 12 days ago • 32
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 13 days ago • 105
Iterative Value Function Optimization for Guided Decoding Paper • 2503.02368 • Published 16 days ago • 14