AnimateAnything: Consistent and Controllable Animation for Video Generation Paper β’ 2411.10836 β’ Published 8 days ago β’ 18
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper β’ 2411.07126 β’ Published 13 days ago β’ 28
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Paper β’ 2410.19355 β’ Published about 1 month ago β’ 23
VidPanos: Generative Panoramic Videos from Casual Panning Videos Paper β’ 2410.13832 β’ Published Oct 17 β’ 12
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. β’ 6 items β’ Updated Oct 15 β’ 141
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper β’ 2410.10792 β’ Published Oct 14 β’ 26
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper β’ 2410.10306 β’ Published Oct 14 β’ 52
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation Paper β’ 2410.07171 β’ Published Oct 9 β’ 41
TextToon: Real-Time Text Toonify Head Avatar from Single Video Paper β’ 2410.07160 β’ Published Sep 23 β’ 8
TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation Paper β’ 2410.05591 β’ Published Oct 8 β’ 13
DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion Paper β’ 2409.17145 β’ Published Sep 25 β’ 13
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation Paper β’ 2409.18964 β’ Published Sep 27 β’ 25
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness Paper β’ 2409.18125 β’ Published Sep 26 β’ 33
Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing Paper β’ 2409.16629 β’ Published Sep 25 β’ 10
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated 10 days ago β’ 273
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts Paper β’ 2409.16040 β’ Published Sep 24 β’ 13