-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper • 2401.15687 • Published • 21 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 23 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper • 2312.13578 • Published • 25 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper • 2312.13150 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2402.13251
-
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper • 2401.13601 • Published • 44 -
A Touch, Vision, and Language Dataset for Multimodal Alignment
Paper • 2402.13232 • Published • 13 -
Neural Network Diffusion
Paper • 2402.13144 • Published • 94 -
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
Paper • 2402.13251 • Published • 13
-
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 12 -
Sketch2NeRF: Multi-view Sketch-guided Text-to-3D Generation
Paper • 2401.14257 • Published • 9 -
pix2gestalt: Amodal Segmentation by Synthesizing Wholes
Paper • 2401.14398 • Published • 8 -
AGG: Amortized Generative 3D Gaussians for Single Image to 3D
Paper • 2401.04099 • Published • 8
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 9 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 12 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 21 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 6
-
deepseek-ai/deepseek-coder-6.7b-base
Text Generation • Updated • 18.6k • 80 -
vikhyatk/moondream1
Text Generation • Updated • 212k • 474 -
162😽
Whisper Speech X DreamTalk
Combine voice cloning and portrait lipsync animation
-
FlashTex: Fast Relightable Mesh Texturing with LightControlNet
Paper • 2402.13251 • Published • 13
-
PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction
Paper • 2311.12024 • Published • 18 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper • 2311.13141 • Published • 12 -
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor
Paper • 2312.07661 • Published • 16 -
HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs
Paper • 2312.14140 • Published • 6
-
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Paper • 2310.12474 • Published • 5 -
Drivable 3D Gaussian Avatars
Paper • 2311.08581 • Published • 46 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper • 2311.12775 • Published • 28 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper • 2311.13141 • Published • 12