InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework Paper • 2504.12395 • Published 7 days ago • 16
Cobra: Efficient Line Art COlorization with BRoAder References Paper • 2504.12240 • Published 7 days ago • 27
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation Paper • 2504.07405 • Published 14 days ago • 12
Compass Control: Multi Object Orientation Control for Text-to-Image Generation Paper • 2504.06752 • Published 15 days ago • 10
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning Paper • 2504.07960 • Published 13 days ago • 46
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model Paper • 2504.07615 • Published 14 days ago • 30
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 9 days ago • 239
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper • 2504.08685 • Published 12 days ago • 121
TAPNext: Tracking Any Point (TAP) as Next Token Prediction Paper • 2504.05579 • Published 16 days ago • 5
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting Paper • 2504.05541 • Published 16 days ago • 16
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance Paper • 2504.06232 • Published 15 days ago • 12
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published 21 days ago • 35
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 15 days ago • 61
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 15 days ago • 149