CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM Paper • 2411.04954 • Published 26 days ago • 8
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images Paper • 2411.05738 • Published 25 days ago • 14
KMM: Key Frame Mask Mamba for Extended Motion Generation Paper • 2411.06481 • Published 23 days ago • 4
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision Paper • 2411.07199 • Published 22 days ago • 44
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published 19 days ago • 56
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper • 2411.06558 • Published 23 days ago • 34
Sora Reference Papers Collection A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Oct 3 • 52
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding Paper • 2312.04461 • Published Dec 7, 2023 • 57
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models Paper • 2308.06721 • Published Aug 13, 2023 • 29
StarVector: Generating Scalable Vector Graphics Code from Images Paper • 2312.11556 • Published Dec 17, 2023 • 27