-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 23 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 111 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 71 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 30
Collections
Discover the best community collections!
Collections including paper arxiv:2402.13144
-
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Paper • 2310.16656 • Published • 37 -
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Paper • 2310.16825 • Published • 28 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 39 -
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Paper • 2311.04145 • Published • 30
-
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 39 -
AToM: Amortized Text-to-Mesh using 2D Diffusion
Paper • 2402.00867 • Published • 10 -
Neural Network Diffusion
Paper • 2402.13144 • Published • 93 -
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Paper • 2402.19479 • Published • 30
-
Text-to-3D using Gaussian Splatting
Paper • 2309.16585 • Published • 29 -
FP8-LM: Training FP8 Large Language Models
Paper • 2310.18313 • Published • 30 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 116 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper • 2312.06585 • Published • 26
-
LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Paper • 2309.15103 • Published • 42 -
Neural Network Diffusion
Paper • 2402.13144 • Published • 93 -
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Paper • 2402.10210 • Published • 28 -
FiT: Flexible Vision Transformer for Diffusion Model
Paper • 2402.12376 • Published • 46