Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
DmitryRyuminΒ 
posted an update Mar 14
Post
πŸš€πŸŽ­πŸŒŸ New Research Alert! 🌟🎭 πŸš€
πŸ“„ Title: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis πŸŒŸπŸš€

πŸ“ Description: VLOGGER is a method for text- and audio-driven generation of talking human video from a single input image of a person, building on the success of recent generative diffusion models.

πŸ‘₯ Authors: @enriccorona , @Andreiz , @kolotouros , @thiemoall , and et al.

πŸ”— Paper: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis (2403.08764)

🌐 Github Page: https://enriccorona.github.io/vlogger/

πŸ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

πŸš€ Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

πŸ” Keywords: #VLOGGER #EmbodiedAvatarSynthesis #MultimodalDiffusion #GenerativeDiffusionModels #DeepLearning #Animation #Innovation
In this post