CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Paper • 2501.11325 • Published Jan 20 • 5
Towards Diverse and Efficient Audio Captioning via Diffusion Models Paper • 2409.09401 • Published Sep 14, 2024 • 7
gsplat: An Open-Source Library for Gaussian Splatting Paper • 2409.06765 • Published Sep 10, 2024 • 16
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Paper • 2409.07452 • Published Sep 11, 2024 • 20