Collections
Discover the best community collections!
Collections including paper arxiv:2312.11392
-
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper β’ 2401.06105 β’ Published β’ 46 -
Image Sculpting: Precise Object Editing with 3D Geometry Control
Paper β’ 2401.01702 β’ Published β’ 18 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper β’ 2312.11392 β’ Published β’ 18
-
aMUSEd: An Open MUSE Reproduction
Paper β’ 2401.01808 β’ Published β’ 26 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper β’ 2401.01885 β’ Published β’ 26 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper β’ 2401.00604 β’ Published β’ 4 -
LARP: Language-Agent Role Play for Open-World Games
Paper β’ 2312.17653 β’ Published β’ 29
-
Gemini: A Family of Highly Capable Multimodal Models
Paper β’ 2312.11805 β’ Published β’ 44 -
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
Paper β’ 2312.13314 β’ Published β’ 6 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper β’ 2312.11514 β’ Published β’ 253 -
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Paper β’ 2312.09911 β’ Published β’ 51
-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper β’ 2312.11556 β’ Published β’ 26 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper β’ 2312.12423 β’ Published β’ 12 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper β’ 2312.11392 β’ Published β’ 18 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video β’ Updated β’ 139k β’ 2.25k
-
GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis
Paper β’ 2312.11458 β’ Published β’ 4 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper β’ 2312.11392 β’ Published β’ 18 -
LangSplat: 3D Language Gaussian Splatting
Paper β’ 2312.16084 β’ Published β’ 14 -
Human101: Training 100+FPS Human Gaussians in 100s from 1 View
Paper β’ 2312.15258 β’ Published β’ 6
-
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper β’ 2312.11392 β’ Published β’ 18 -
MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising
Paper β’ 2312.10899 β’ Published β’ 14 -
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
Paper β’ 2311.17002 β’ Published β’ 5 -
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
Paper β’ 2311.15773 β’ Published β’ 4