Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published 6 days ago • 33
Material Anything: Generating Materials for Any 3D Object via Diffusion Paper • 2411.15138 • Published 6 days ago • 40
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 13 days ago • 105
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper • 2408.15998 • Published Aug 28 • 83