Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published 7 days ago • 27
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 1 day ago • 51
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 2 days ago • 62
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 1 day ago • 75
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 1 day ago • 97
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published 15 days ago • 22
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation Paper • 2503.22194 • Published 13 days ago • 23
Expanding RL with Verifiable Rewards Across Diverse Domains Paper • 2503.23829 • Published 10 days ago • 18
SketchVideo: Sketch-based Video Generation and Editing Paper • 2503.23284 • Published 11 days ago • 22