Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution Paper • 2312.06640 • Published Dec 11, 2023 • 46
Generic 3D Diffusion Adapter Using Controlled Multi-View Editing Paper • 2403.12032 • Published Mar 18 • 14
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion Paper • 2403.12008 • Published Mar 18 • 19
OpenCulture Collection A multilingual dataset of public domain books and newspapers. • 27 items • Updated 14 days ago • 117
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions Paper • 2402.17485 • Published Feb 27 • 189
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 59
Depth Anything Release Collection Depth Anything models, foundation models for monocular depth estimation, trained on 1.5 million labeled images and 62 million unlabeled images • 8 items • Updated Jan 26 • 9
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models Paper • 2310.11954 • Published Oct 18, 2023 • 24
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity Paper • 2101.03961 • Published Jan 11, 2021 • 14