@DmitryRyumin on Hugging Face: "🚀🎭🌟 New Research Alert! 🌟🎭 🚀 📄 Title: VLOGGER: Multimodal Diffusion for…"

Post

🚀🎭🌟 New Research Alert! 🌟🎭 🚀
📄 Title: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis 🌟🚀

📝 Description: VLOGGER is a method for text- and audio-driven generation of talking human video from a single input image of a person, building on the success of recent generative diffusion models.

👥 Authors: @enriccorona , @Andreiz , @kolotouros , @thiemoall , and et al.

🔗 Paper: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis (2403.08764)

🌐 Github Page: https://enriccorona.github.io/vlogger/

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #VLOGGER #EmbodiedAvatarSynthesis #MultimodalDiffusion #GenerativeDiffusionModels #DeepLearning #Animation #Innovation