Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior Paper • 2303.14184 • Published Mar 24, 2023
StyleSwin: Transformer-based GAN for High-resolution Image Generation Paper • 2112.10762 • Published Dec 20, 2021
MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding Paper • 2406.04264 • Published Jun 6 • 1
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence Paper • 2407.16655 • Published Jul 23 • 28
DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation Paper • 2405.15619 • Published May 24
DeepSeek-VL: Towards Real-World Vision-Language Understanding Paper • 2403.05525 • Published Mar 8 • 39
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Paper • 2310.16818 • Published Oct 25, 2023 • 30
Vector Quantized Diffusion Model for Text-to-Image Synthesis Paper • 2111.14822 • Published Nov 29, 2021
Paint by Example: Exemplar-based Image Editing with Diffusion Models Paper • 2211.13227 • Published Nov 23, 2022 • 2