Submitted by akhaliq 29 MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer · 9 authors 1
Submitted by akhaliq 28 SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering · 2 authors 3
Submitted by akhaliq 27 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis · 4 authors 1
Submitted by akhaliq 26 NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation · 3 authors 1
Submitted by akhaliq 19 PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics · 7 authors 1
Submitted by akhaliq 19 Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models · 5 authors 4
Submitted by akhaliq 16 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction · 9 authors 3
Submitted by akhaliq 12 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning · 9 authors 1