Post
πππ New Research Alert! ππ π
π Title: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis ππ
π Description: VLOGGER is a method for text- and audio-driven generation of talking human video from a single input image of a person, building on the success of recent generative diffusion models.
π₯ Authors: @enriccorona , @Andreiz , @kolotouros , @thiemoall , and et al.
π Paper: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis (2403.08764)
π Github Page: https://enriccorona.github.io/vlogger/
π More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin
π Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36
π Keywords: #VLOGGER #EmbodiedAvatarSynthesis #MultimodalDiffusion #GenerativeDiffusionModels #DeepLearning #Animation #Innovation
π Title: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis ππ
π Description: VLOGGER is a method for text- and audio-driven generation of talking human video from a single input image of a person, building on the success of recent generative diffusion models.
π₯ Authors: @enriccorona , @Andreiz , @kolotouros , @thiemoall , and et al.
π Paper: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis (2403.08764)
π Github Page: https://enriccorona.github.io/vlogger/
π More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin
π Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36
π Keywords: #VLOGGER #EmbodiedAvatarSynthesis #MultimodalDiffusion #GenerativeDiffusionModels #DeepLearning #Animation #Innovation