SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper • 2412.00174 • Published 5 days ago • 13
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM Paper • 2411.04954 • Published 27 days ago • 8
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images Paper • 2411.05738 • Published 26 days ago • 14
KMM: Key Frame Mask Mamba for Extended Motion Generation Paper • 2411.06481 • Published 24 days ago • 4
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Paper • 2411.07199 • Published 23 days ago • 44
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 20 days ago • 56
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper • 2411.06558 • Published 24 days ago • 34