Junyi42/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt Image-to-3D • Updated 24 days ago • 5.09k • 13
Junyi42/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt Image-to-3D • Updated 24 days ago • 5.09k • 13
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4 • 17
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4 • 17 • 3
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4 • 17
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4 • 17 • 3
CameraCtrl: Enabling Camera Control for Text-to-Video Generation Paper • 2404.02101 • Published Apr 2 • 22
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29 • 32
stabilityai/stable-video-diffusion-img2vid-xt-1-1 Image-to-Video • Updated Jul 10 • 21.1k • 761
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models Paper • 2303.11589 • Published Mar 21, 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence Paper • 2305.15347 • Published May 24, 2023