Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases Paper • 2312.15011 • Published Dec 22, 2023 • 15
LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation Paper • 2408.13252 • Published Aug 23 • 23
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models Paper • 2412.07674 • Published 5 days ago • 20
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation Paper • 2409.18261 • Published Sep 26
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8 • 11
FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models Paper • 2412.07674 • Published 5 days ago • 20
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published 13 days ago • 37
Imagine360: Immersive 360 Video Generation from Perspective Anchor Paper • 2412.03552 • Published 10 days ago • 26
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8 • 11