Dmitry Ryumin

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Organizations

DmitryRyumin's activity

posted an update 5 days ago
view post
Post
865
🚀👕🌟 New Research Alert - SIGGRAPH 2024 (Avatars Collection)! 🌟👚🚀
📄 Title: LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer 🔝

📝 Description: LayGA is a novel method for animatable clothing transfer that separates the body and clothing into two layers for improved photorealism and accurate clothing tracking, outperforming existing methods.

👥 Authors: Siyou Lin, Zhe Li, Zhaoqi Su, Zerong Zheng, Hongwen Zhang, and Yebin Liu

📅 Conference: SIGGRAPH, 28 Jul – 1 Aug, 2024 | Denver CO, USA 🇺🇸

📄 Paper: LayGA: Layered Gaussian Avatars for Animatable Clothing Transfer (2405.07319)

🌐 Github Page: https://jsnln.github.io/layga/index.html

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #LayGA #AnimatableClothingTransfer #VirtualTryOn #AvatarTechnology #SIGGRAPH2024 #ComputerGraphics #DeepLearning #ComputerVision #Innovation
replied to their post 7 days ago
posted an update 7 days ago
view post
Post
1159
🚀🎭🌟 New Research Alert - AniTalker (Avatars Collection)! 🌟🎭🚀
📄 Title: AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding 🔝

📝 Description: AniTalker is a new framework that transforms a single static portrait and a single input audio file into animated, talking videos with natural, fluid movements.

👥 Authors: Tao Liu, Feilong Chen, Shuai Fan, @cpdu , Qi Chen, Xie Chen, and Kai Yu

📄 Paper: AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding (2405.03121)

🌐 Github Page: https://x-lance.github.io/AniTalker
📁 Repository: https://github.com/X-LANCE/AniTalker

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #AniTalker #FacialAnimation #DynamicAvatars #FaceSynthesis #TalkingFaces #DiffusionModel #ComputerGraphics #DeepLearning #ComputerVision #Innovation
  • 2 replies
·
posted an update 9 days ago
view post
Post
1017
😀😲😐😡 New Research Alert - FER-YOLO-Mamba (Facial Expressions Recognition Collection)! 😡😥🥴😱
📄 Title: FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space 🔝

📝 Description: FER-YOLO-Mamba is a novel facial expression recognition model that combines the strengths of YOLO and Mamba technologies to efficiently recognize and localize facial expressions.

👥 Authors: Hui Ma, Sen Lei, Turgay Celik, and Heng-Chao Li

🔗 Paper: FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space (2405.01828)

📁 Repository: https://github.com/SwjtuMa/FER-YOLO-Mamba

🚀 Added to the Facial Expressions Recognition Collection: DmitryRyumin/facial-expressions-recognition-65f22574e0724601636ddaf7

🔥🔝 See also Facial_Expression_Recognition - ElenaRyumina/Facial_Expression_Recognition (App, co-authored by @DmitryRyumin ) 😉

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #FERYOLOMamba #FER #YOLO #Mamba #FacialExpressionRecognition #EmotionRecognition #ComputerVision #DeepLearning #MachineLearning #Innovation
posted an update 10 days ago
view post
Post
1668
🔥🚀🌟 New Research Alert - YOCO! 🌟🚀🔥
📄 Title: You Only Cache Once: Decoder-Decoder Architectures for Language Models 🔝

📝 Description: YOCO is a novel decoder-decoder architecture for LLMs that reduces memory requirements, speeds up prefilling, and maintains global attention. It consists of a self-decoder for encoding KV caches and a cross-decoder for reusing these caches via cross-attention.

👥 Authors: Yutao Sun et al.

📄 Paper: You Only Cache Once: Decoder-Decoder Architectures for Language Models (2405.05254)

📁 Repository: https://github.com/microsoft/unilm/tree/master/YOCO

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #YOCO #DecoderDecoder #LargeLanguageModels #EfficientArchitecture #GPUMemoryReduction #PrefillingSpeedup #GlobalAttention #DeepLearning #Innovation #AI
  • 2 replies
·
posted an update 11 days ago
view post
Post
1881
🔥🚀🌟 New Research Alert - xLSTM! 🌟🚀🔥
📄 Title: xLSTM: Extended Long Short-Term Memory 🔝

📝 Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

Eagerly awaiting the code release! 🕒️

👥 Authors: Maximilian Beck et al.

📄 Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #xLSTM #DeepLearning #Innovation #AI
  • 1 reply
·
posted an update 14 days ago
view post
Post
2495
🚀🎭🌟 New Research Alert - SIGGRAPH 2024 (Avatars Collection)! 🌟🎭🚀
📄 Title: 3D Gaussian Blendshapes for Head Avatar Animation 🔝

📝 Description: 3D Gaussian Blendshapes for Head Avatar Animation is a novel method for modeling and animating photorealistic head avatars from monocular video input.

👥 Authors: Shengjie Ma, Yanlin Weng, Tianjia Shao, and Kun Zhou

📅 Conference: SIGGRAPH, 28 Jul – 1 Aug, 2024 | Denver CO, USA 🇺🇸

📄 Paper: 3D Gaussian Blendshapes for Head Avatar Animation (2404.19398)

🌐 Github Page: https://gapszju.github.io/GaussianBlendshape/

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #3DAnimation #HeadAvatar #GaussianBlendshapes #FacialAnimation #RealTimeRendering #SIGGRAPH2024 #ComputerGraphics #DeepLearning #ComputerVision #Innovation
replied to their post 18 days ago
view reply

The authors plan to release the Dataset itself in June and the code in July, apparently during CVPR.

posted an update 19 days ago
view post
Post
1316
🚀🎭🌟 New Research Alert - CVPR 2024 (Avatars Collection)! 🌟🎭🚀
📄 Title: EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars 🔝

📝 Description: EMOPortraits is an enhanced multimodal one-shot head avatar model that achieves SOTA performance in emotion transfer and audio-driven facial animation tasks by improving the training pipeline and architecture to better handle intense and asymmetric facial expressions, while also proposing a novel multiview video dataset containing a wide range of such expressions.

👥 Authors: Nikita Drobyshev, Antoni Bigata Casademunt, Konstantinos Vougioukas, Zoe Landgraf, Stavros Petridis, and Maja Pantic

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

📄 Paper: EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars (2404.19110)

🌐 GitHub Page: https://neeek2303.github.io/EMOPortraits

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #EMOPortraits #EmotionalTransfer #FacialAnimation #HeadAvatar #MultimodalLearning #OneShotLearning #AsymmetricFacialExpressions #IntenseFacialExpressions #NovelDataset #CVPR2024 #DeepLearning #ComputerVision #Innovation
  • 3 replies
·
posted an update 23 days ago
view post
Post
1472
🚀🎭🔥 New Research Alert (Avatars Collection)! 🔥🎭🚀
📄 Title: ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving 🔝

📝 Description: ConsistentID is a novel portrait generation method that preserves the fine-grained identity of a single reference image.

👥 Authors: Jiehui Huang et al.

🔗 Paper: ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving (2404.16771)

🌐 Github Page: https://ssugarwh.github.io/consistentid.github.io/
📁 Repository: https://github.com/JackAILab/ConsistentID

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #ConsistentID #PortraitGeneration #IdentityPreservation #FineGrainedControl #ImageSynthesis #GenerativeModels #ComputerVision #DeepLearning
posted an update 24 days ago
view post
Post
2146
🚀🕺🌟 New Research Alert - CVPR 2024 (Avatars Collection)! 🌟💃🚀
📄 Title: WANDR: Intention-guided Human Motion Generation 🔝

📝 Description: WANDR is a conditional Variational AutoEncoder (c-VAE) that generates realistic motion of human avatars that navigate towards an arbitrary goal location and reach for it.

👥 Authors: Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

📄 Paper: WANDR: Intention-guided Human Motion Generation (2404.15383)

🌐 Web Page: https://wandr.is.tue.mpg.de
📁 Repository: https://github.com/markos-diomataris/wandr

📺 Video: https://www.youtube.com/watch?v=9szizM-XUCg

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #WANDR #HumanMotionGeneration #MotionSynthesis #3DAvatar #GoalOrientedMovement #IntentionGuided #ConditionalVAE #CVPR2024 #DeepLearning #Innovation
posted an update 26 days ago
view post
Post
1900
🚀🎭🔥 New Research Alert (Avatars Collection)! 🔥👄🚀
📄 Title: Learn2Talk: 3D Talking Face Learns from 2D Talking Face

📝 Description: Learn2Talk is a framework that leverages expertise from 2D talking face methods to improve 3D talking face synthesis, focusing on lip synchronization and speech perception.

👥 Authors: Yixiang Zhuang et al.

🔗 Paper: Learn2Talk: 3D Talking Face Learns from 2D Talking Face (2404.12888)

🌐 Github Page: https://lkjkjoiuiu.github.io/Learn2Talk/

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #Learn2Talk #3DTalkingFace #SpeechDrivenFacialAnimation #LipSync #SpeechPerception #ComputerVision #ImageProcessing #DeepLearning
posted an update 28 days ago
view post
Post
1773
😀🤓😎 New Research Alert - NAACL 2024 (Big Five Personality Traits Collection)! 😎😉😤
📄 Title: PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits 💬

📝 Description: This research examines the ability of LLMs to express personality traits and finds that LLMs can generate content consistent with assigned personality profiles and that humans can recognize certain traits with up to 80% accuracy. However, accuracy drops significantly when annotators are aware that the content was generated by an AI.

👥 Authors: Hang Jiang et al.

📅 Conference: NAACL, June 16–21, 2024 | Mexico City, Mexico 🇲🇽

🔗 Paper: PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits (2305.02547)

📁 Repository: https://github.com/hjian42/PersonaLLM

🚀 Added to the Big Five Personality Traits Collection: DmitryRyumin/big-five-personality-traits-661fb545292ab3d12a5a4890

🔥🔝 See also OCEAN-AI - ElenaRyumina/OCEANAI (App, co-authored by @DmitryRyumin ) 😉

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #PersonaLLM #OCEANAI #BigFive #PersonalityTraits #PersonalityAnalysis #Chatbots #LLMs #NAACL2024 #DeepLearning #Innovation
posted an update 29 days ago
view post
Post
3046
🚀💇‍♂️🔥 New Research Alert (Avatars Collection)! 🔥💇‍♀️🚀
📄 Title: HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach

📝 Description: HairFastGAN is a fast, encoder-based approach to realistic and robust hair transfer that operates in the FS latent space of StyleGAN and includes enhanced in-painting and improved encoders for better alignment, color transfer, and post-processing.

👥 Authors: Maxim Nikolaev, Mikhail Kuznetsov, Dmitry Vetrov, and Aibek Alanov

🔗 Paper: HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach (2404.01094)

📁 Repository: https://github.com/AIRI-Institute/HairFastGAN

🤗 Demo: multimodalart/hairfastgan
🔥 Model 🤖: AIRI-Institute/HairFastGAN

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #HairFastGAN #StyleGAN #VirtualTryOn #HairTransfer #AIHairStyling #GenerativeModels #ComputerVision #ImageProcessing #DeepLearning
posted an update about 1 month ago
view post
Post
2286
🚀👩‍🎤🌟 New Research Alert - CVPR 2024! 🌟👩‍🎤🚀
📄 Title: Generalizable Face Landmarking Guided by Conditional Face Warping

📝 Description: A new method is proposed to learn a generalizable face landmark that can handle different facial styles, using labeled real faces and unlabeled stylized faces.

👥 Authors: Jiayi Liang, Haotian Liu, Hongteng Xu, Dixin Luo

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: Generalizable Face Landmarking Guided by Conditional Face Warping (2404.12322)

🌐 Github Page: https://plustwo0.github.io/project-face-landmarker/
📁 Repository: https://github.com/plustwo0/generalized-face-landmarker

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #FaceLandmarking #DomainAdaptation #FaceWarpping #CVPR2024 #DeepLearning #Innovation
replied to Jaward's post about 1 month ago
view reply

Thank you so much! I'm planning a series of posts on Personality Traits and the Big Five. All posts will be related to the collection you mentioned. Also, together with @ElenaRyumina , we are planning to expand the App (OCEANAI) and are preparing a publication. I hope to be able to share in the future after the accepted paper to the A-Level Conference.

replied to Jaward's post about 1 month ago
posted an update about 1 month ago
view post
Post
2923
😀🤓😎 New Research Alert - LREC-COLING 2024 (Big Five Personality Traits Collection)! 😎😉😤
📄 Title: PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language Models

📝 Description: The PSYDIAL presents a novel pipeline for generating personality-based synthetic dialog data to elicit more human-like responses from language models, and presents a Korean dialog dataset focused on personality-based dialog.

👥 Authors: Ji-Eun Han et al.

📅 Conference: LREC-COLING, May 20-25, 2024 | Torino, Italia 🇮🇹

🔗 Paper: PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language Models (2404.00930)

🚀 Added to the Big Five Personality Traits Collection: DmitryRyumin/big-five-personality-traits-661fb545292ab3d12a5a4890

🔥🔝 See also OCEAN-AI - ElenaRyumina/OCEANAI (App, co-authored by @DmitryRyumin ) 😉

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #PSYDIAL #PersonalityDialogues #SyntheticData #LanguageModels #ConversationalAI #KoreanDialogues #BigFivePersonality #ExtraveresionDialogues #OCEANAI #BigFive #PersonalityTraits #PersonalityAnalysis #LREC-COLING2024 #DeepLearning #Innovation
posted an update about 1 month ago
view post
Post
2443
😀🤓😎 New Space - OCEAN-AI (App, co-authored by @DmitryRyumin ) 😎😉😤
🚀 Title: OCEAN-AI is an open-source app for Big Five personality traits assessment and HR-processes automatization.

🤗 Demo: ElenaRyumina/OCEANAI

👥 Authors: @ElenaRyumina , @DmitryRyumin , and Alexey Karpov

📝 Description: OCEAN-AI consists of a set of modules for intellectual analysis of human behavior based on multimodal data for automatic personality traits (PT) assessment. The app evaluates five PT: Openness to experience, Conscientiousness, Extraversion, Agreeableness, Non-Neuroticism.

The App solves practical tasks:
- Ranking of potential candidates by professional responsibilities.
- Forming effective work teams.
- Predicting consumer preferences for industrial goods.

🔍 Keywords: #OCEANAI #BigFive #PersonalityTraits #PersonalityAnalysis #MultimodalData #Transformers #FirstImpressionsV2 #DeepLearning #Innovation #BehaviorAnalysis #AffectiveRecognition #TeamFormation #ConsumerPreferences #CandidateRanking
  • 1 reply
·
posted an update about 1 month ago
view post
Post
2290
🕺🎬🔥 New Research Alert - CVPR 2024 (Avatars Collection)! 🔥🤖⚡
📄 Title: GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh

📝 Description: GoMAvatar is an efficient method for real-time, high-quality, animatable human modeling from a single monocular video. It combines the rendering quality of Gaussian splatting with the geometry modeling capabilities of deformable meshes, enabling realistic digital avatars that can be rearticulated in new poses and rendered from novel angles, while seamlessly integrating with graphics pipelines.

👥 Authors: Jing Wen, Xiaoming Zhao, Zhongzheng Ren, Alexander G. Schwing, Shenlong Wang

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh (2404.07991)

🌐 Github Page: https://wenj.github.io/GoMAvatar/
📁 Repository: https://github.com/wenj/GoMAvatar

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #GoMAvatar #3DAvatar #3DAnimation #AnimatableAvatars #MonocularVideo #RealTimeRendering #HumanModeling #CVPR2024 #DeepLearning #Innovation
posted an update about 1 month ago
view post
Post
2468
🚀🕺🌟 New Research Alert (Avatars Collection)! 🌟💃🚀
📄 Title: PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations 🔝

📝 Description: PhysAvatar is a novel framework that uses inverse rendering and physics to autonomously reconstruct the shape, appearance, and physical properties of clothed human avatars from multi-view video data.

👥 Authors: Yang Zheng et al.

🔗 Paper: PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations (2404.04421)

🌐 GitHub Page: https://qingqing-zhao.github.io/PhysAvatar

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #PhysAvatar #DigitalHumans #InverseRendering #PhysicsSimulation #AvatarModeling #ClothSimulation #PhotorealisticRendering #ComputerVision #DeepLearning #Innovation
posted an update about 1 month ago
view post
Post
2239
🚀💃🌟 New Research Alert (Avatars Collection)! 🌟🕺🚀
📄 Title: InstructHumans: Editing Animated 3D Human Textures with Instructions

📝 Description: InstructHumans is a novel framework for text-instructed editing of 3D human textures that employs a modified Score Distillation Sampling (SDS-E) method along with spatial smoothness regularization and gradient-based viewpoint sampling to achieve high-quality, consistent, and instruction-true edits.

👥 Authors: Jiayin Zhu, Linlin Yang, Angela Yao

🔗 Paper: InstructHumans: Editing Animated 3D Human Textures with Instructions (2404.04037)

🌐 Web Page: https://jyzhu.top/instruct-humans
📁 Repository: https://github.com/viridityzhu/InstructHumans

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #InstructHumans #3DTextureEditing #TextInstructions #ScoreDistillationSampling #SDS-E #SpatialSmoothnessRegularization #3DEditing #AvatarEditing #DeepLearning #Innovation
posted an update about 1 month ago
view post
Post
2531
🚀🕺🌟 New Research Alert - CVPR 2024 (Avatars Collection)! 🌟💃🚀
📄 Title: 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting 🔝

📝 Description: 3DGS-Avatar is a novel method for creating animatable human avatars from monocular videos using 3D Gaussian Splatting (3DGS). By using a non-rigid deformation network and as-isometric-as-possible regularizations, the method achieves comparable or better performance than SOTA methods while being 400x faster in training and 250x faster in inference, allowing real-time rendering at 50+ FPS.

👥 Authors: Zhiyin Qian, Shaofei Wang, Marko Mihajlovic, Andreas Geiger, Siyu Tang

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting (2312.09228)

🌐 Github Page: https://neuralbodies.github.io/3DGS-Avatar/
📁 Repository: https://github.com/mikeqzy/3dgs-avatar-release

📺 Video: https://www.youtube.com/watch?v=FJ29U9OkmmU

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #3DGSAvatar #3DAvatar #3DGaussianSplatting #AnimatableAvatars #MonocularVideo #RealTimeRendering #FastTraining #EfficientInference #CVPR2024 #DeepLearning #Innovation
posted an update about 2 months ago
view post
Post
2268
🚀🎭🌟 New Research Alert - CVPR 2024 (Avatars Collection)! 🌟 🎭🚀
📄 Title: GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image 🔝

📝 Description: GeneAvatar is a generic approach for editing 3D head avatars based on a single 2D image, applicable to different volumetric representations. The novel expression-aware generative modification model delivers high quality and consistent editing results across multiple viewpoints and emotions.

👥 Authors: Chong Bao et al.

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image (2404.02152)

🌐 Github Page: https://zju3dv.github.io/geneavatar/
📁 Repository: https://github.com/zju3dv/GeneAvatar

📺 Video: https://www.youtube.com/watch?v=4zfbfPivtVU

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #GeneAvatar #HeadAvatar #3DHeadAvatarEditing #VolumetricHeadAvatar #SingleImageEditing #ExpressionAwareModification #CVPR2024 #DeepLearning #Innovation
posted an update about 2 months ago
view post
Post
1785
🚀🎭🌟 New Research Alert - CVPR 2024 (Avatars Collection)! 🌟 🎭🚀
📄 Title: MonoAvatar++: Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes 🔝

📝 Description: MonoAvatar++ is a real-time neural implicit 3D head avatar model with high quality and fine-grained control over facial expressions. It uses local hash table blendshapes attached to a parametric facial model for efficient rendering, achieving SOTA results even for challenging expressions.

👥 Authors: Ziqian Bai, Feitong Tan, Sean Fanello, Rohit Pandey, Mingsong Dou, Shichen Liu, Ping Tan, Yinda Zhang

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes (2404.01543)

🌐 Github Page: https://augmentedperception.github.io/monoavatar-plus

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #MonoAvatar++ #HeadAvatar #3DModeling #AvatarGeneration #NeuralImplicitAvatar #EfficientRendering #CVPR2024 #DeepLearning #Innovation
replied to their post about 2 months ago
view reply

Hi @researcher171473 ,

The idea of using GANs or latent diffusion models to augment visual data instead of image mixing is indeed interesting. However, I have a few considerations:

  1. Training GANs and diffusion models is typically more resource intensive than simple image mixing.
  2. Ensuring that the generated examples are sufficiently informative and diverse to improve the classifier may require additional mechanisms (diversity regularization, adversarial training, latent space manipulation, domain-specific constraints, etc.).
  3. The generated examples must retain their original semantics and class membership to effectively complement the training data.
  4. The classifier may overfit the generated examples and lose performance on real data.

Despite these potential challenges, combining image blending with generative models could potentially yield better results. For example, GANs could be used to generate additional realistic samples that can then be mixed to increase diversity.

posted an update about 2 months ago
view post
Post
1411
🎯🖼️🌟 New Research Alert - ICLR 2024! 🌟 🖼️🎯
📄 Title: Adversarial AutoMixup 🖼️

📝 Description: Adversarial AutoMixup is an approach to image classification augmentation. By alternately optimizing a classifier and a mixed-sample generator, it attempts to generate challenging samples and improve the robustness of the classifier against overfitting.

👥 Authors: Huafeng Qin et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: Adversarial AutoMixup (2312.11954)

📁 Repository: https://github.com/JinXins/Adversarial-AutoMixup

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #AutoMixup #ImageClassification #ImageAugmentation #AdversarialLearning #ICLR2024 #DeepLearning #Innovation
·
posted an update about 2 months ago
view post
Post
2131
☁️☔ New Research Alert! ❄️🌙
📄 Title: CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning

📝 Description: CoDA is a UDA methodology that boosts models to understand all adverse scenes (☁️,☔,❄️,🌙) by highlighting the discrepancies within these scenes. CoDA achieves state-of-the-art performances on widely used benchmarks.

👥 Authors: Ziyang Gong, Fuhao Li, Yupeng Deng, Deblina Bhattacharjee, Xiangwei Zhu, Zhenming Ji

🔗 Paper: CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning (2403.17369)

📁 Repository: https://github.com/Cuzyoung/CoDA

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #CoDA #DomainAdaptation #VisualPromptTuning #SAVPT #DeepLearning #Innovation
posted an update about 2 months ago
view post
Post
1731
🚀🎭🌟 New Research Alert! 🌟 🎭🚀
📄 Title: AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation 🔝

📝 Description: AniPortrait is a novel framework for generating photorealistic portrait animations driven by audio and a reference image, with superior facial naturalness, pose variety, and visual quality, with potential applications in facial motion editing and facial reenactment.

👥 Authors: Huawei Wei, @ZJYang , Zhisheng Wang

🔗 Paper: AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation (2403.17694)

📁 Repository: https://github.com/Zejun-Yang/AniPortrait

🤗 Demo: ZJYang/AniPortrait_official
🔥 Model 🤖: ZJYang/AniPortrait

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #AniPortrait #Animation #AudioDriven #Photorealistic #FacialAnimation #DeepLearning #Innovation
  • 2 replies
·
posted an update about 2 months ago
view post
Post
1580
🚀🎭🌟 New Research Alert! 🌟 🎭🚀
📄 Title: FlashFace: Human Image Personalization with High-fidelity Identity Preservation 🔝

📝 Description: FlashFace is a personalized photo editing tool that focuses on high-fidelity identity preservation and improved compliance through advanced encoding and integration strategies.

👥 Authors: Shilong Zhang, Lianghua Huang, @xichenhku et al.

🔗 Paper: FlashFace: Human Image Personalization with High-fidelity Identity Preservation (2403.17008)

🌐 Github Page: https://jshilong.github.io/flashface-page

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #FlashFace #Personalization #HighFidelityIdentity #DeepLearning #Innovation
posted an update about 2 months ago
view post
Post
2658
🚀💃🌟 New Research Alert - ICASSP 2024! 🌟 🕺🚀
📄 Title: Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute 🌟

📝 Description: Text2Avatar is a novel approach that can generate realistic 3D human avatars directly from textual descriptions, enabling multi-attribute control and realistic styling, overcoming the challenges of feature coupling and data scarcity in this domain.

👥 Authors: Chaoqun Gong et al.

📅 Conference: ICASSP, 14-19 April 2024 | Seoul, Korea 🇰🇷

🔗 Paper: Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute (2401.00711)

🌐 Github Page: https://iecqgong.github.io/text2avatar/

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

📁 Added to the ICASSP-2023-24-Papers: https://github.com/DmitryRyumin/ICASSP-2023-24-Papers

🔍 Keywords: #AvatarGeneration #Text2Avatar #ICASSP2024 #DeepLearning #Innovation
  • 1 reply
·
posted an update about 2 months ago
view post
Post
1815
🚀🎭🌟 New Research Alert - CVPR 2024! 🌟 🎭🚀
📄 Title: GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians 🔝

📝 Description: GaussianAvatars proposes a novel method for creating photorealistic and fully controllable head avatars by combining a parametric morphable face model with a dynamic 3D representation based on rigged 3D Gaussian splats, enabling high-quality rendering and precise animation control.

👥 Authors: Shenhan Qian, Tobias Kirschstein, Liam Schoneveld, Davide Davoli, Simon Giebenhain, Matthias Nießner

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians (2312.02069)

🌐 Github Page: https://shenhanqian.github.io/gaussian-avatars
📁 Repository: https://github.com/ShenhanQian/GaussianAvatars

📺 Video: https://www.youtube.com/watch?v=lVEY78RwU_I

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #HeadAvatar #GaussianAvatars #DynamicGaussians #3DModeling #AvatarGeneration #CVPR2024 #DeepLearning #Innovation
posted an update about 2 months ago
view post
Post
1362
🚀🎭🌟 New Research Alert - CVPR 2024! 🌟 🎭🚀
📄 Title: Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians 🔝

📝 Description: Gaussian Head Avatar is a method for generating highly detailed 3D head avatars using dynamic Gaussian functions controlled by a neural network, ensuring ultra-high quality visualization even under limited viewpoints.

👥 Authors: Yuelang Xu, @ben55 , Zhe Li, @HongwenZhang , @wanglz14 , Zerong Zheng, and @YebinLiu

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians (2312.03029)

🌐 Github Page: https://yuelangx.github.io/gaussianheadavatar
📁 Repository: https://github.com/YuelangX/Gaussian-Head-Avatar

📺 Video: https://www.youtube.com/watch?v=kvrrI3EoM5g

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #HeadAvatar #DynamicGaussians #3DModeling #AvatarGeneration #CVPR2024 #DeepLearning #Innovation
posted an update 2 months ago
view post
Post
1853
🚀🎭🌟 New Research Alert - ICLR 2024! 🌟 🎭🚀
📄 Title: InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image 🌟🚀

📝 Description: InstructPix2NeRF is a novel approach to instructed 3D portrait editing from a single image, using a conditional latent 3D diffusion process and a token position randomization strategy to enable multi-semantic editing while preserving the identity of the portrait.

👥 Authors: Jianhui Li et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image (2311.02826)

🌐 Github Page: https://mybabyyh.github.io/InstructPix2NeRF
📁 Repository: https://github.com/mybabyyh/InstructPix2NeRF

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #InstructPix2NeRF #AvatarCustomization #3DPortrait #DiffusionProcess #IdentityConsistency #ICLR2024 #DeepLearning #Innovation
posted an update 2 months ago
view post
Post
1919
🚀🕺🌟 New Research Alert - CVPR 2024! 🌟 💃🏻🚀
📄 Title: NECA: Neural Customizable Human Avatar 🌟🚀

📝 Description: The NECA paper presents a novel method for creating customizable human avatars from video, allowing detailed manipulation of pose, shadow, shape, lighting, and texture for realistic rendering and editing.

👥 Authors: Junjin Xiao, Qing Zhang, Zhan Xu, and Wei-Shi Zheng

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: NECA: Neural Customizable Human Avatar (2403.10335)

📁 Repository: https://github.com/iSEE-Laboratory/NECA

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #NECA #AvatarCustomization #RealisticRendering #HumanRepresentation #CVPR2024 #DeepLearning #Animation #Innovation
posted an update 2 months ago
view post
Post
🚀💃🏻🌟 New Research Alert - CVPR 2024! 🌟🕺 🚀
📄 Title: Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling 🌟🚀

📝 Description: Animatable Gaussians - a novel method for creating lifelike human avatars from RGB videos, utilizing 2D CNNs and 3D Gaussian splatting to capture pose-dependent garment details and dynamic appearances with high fidelity.

👥 Authors: Zhe Li, Zerong Zheng, Lizhen Wang, and Yebin Liu

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling (2311.16096)

🌐 Github Page: https://animatable-gaussians.github.io
📁 Repository: https://github.com/lizhe00/AnimatableGaussians

📺 Video: https://www.youtube.com/watch?v=kOmZxD0HxZI

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #AnimatableGaussians #HumanAvatars #3DGaussianSplatting #CVPR2024 #DeepLearning #Animation #Innovation
posted an update 2 months ago
view post
Post
🚀🎭🌟 New Research Alert! 🌟🎭 🚀
📄 Title: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis 🌟🚀

📝 Description: VLOGGER is a method for text- and audio-driven generation of talking human video from a single input image of a person, building on the success of recent generative diffusion models.

👥 Authors: @enriccorona , @Andreiz , @kolotouros , @thiemoall , and et al.

🔗 Paper: VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis (2403.08764)

🌐 Github Page: https://enriccorona.github.io/vlogger/

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #VLOGGER #EmbodiedAvatarSynthesis #MultimodalDiffusion #GenerativeDiffusionModels #DeepLearning #Animation #Innovation
posted an update 2 months ago
view post
Post
🚀🗣️🌟 New Research Alert - ICASSP 2024! 🌟🗣️🚀
📄 Title: AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement 🌟🚀

📝 Description: Diffused Resynthesis and HuBERT Speech Quality Enhancement.

👥 Authors: Ju-Chieh Chou, Chung-Ming Chien, Karen Livescu

📅 Conference: ICASSP, 14-19 April 2024 | Seoul, Korea 🇰🇷

🔗 Paper: AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement (2309.08030)

🌐 Web Page: https://home.ttic.edu/~jcchou/demo/avse/avse_demo.html

📚 More Papers: more cutting-edge research presented at other conferences in the
DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Speech Enhancement Collection: DmitryRyumin/speech-enhancement-65de31e1b6d9a040c151702e

🔍 Keywords: #AV2Wav #SpeechEnhancement #SpeechProcessing #AudioVisual #Diffusion #ICASSP2024 #Innovation
posted an update 2 months ago
view post
Post
🚀🕺🌟 New Research Alert - AAAI 2024! 🌟💃🚀
📄 Title: Relightable and Animatable Neural Avatars from Videos 🌟🚀

📝 Description: Relightable & animatable neural avatars from sparse videos.

👥 Authors: Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, and Feng Xu

📅 Conference: AAAI, February 20-27, 2024 | Vancouver, Canada 🇨🇦

🔗 Paper: Relightable and Animatable Neural Avatars from Videos (2312.12877)

🌐 Github Page: https://wenbin-lin.github.io/RelightableAvatar-page
📁 Repository: https://github.com/wenbin-lin/RelightableAvatar

📺 Video: https://www.youtube.com/watch?v=v9rlys0xQGo

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

📚 Added to the AAAI 2024 Papers: https://github.com/DmitryRyumin/AAAI-2024-Papers

🔍 Keywords: #NeuralAvatar #RelightableAvatars #AnimatableAvatars #3DModeling #PhotorealisticRendering #ShadowModeling #DigitalAvatars #GeometryModeling #AAAI2024 #DeepLearning #Animation #Innovation
posted an update 2 months ago
view post
Post
🚀🖼️🌟 New Research Alert - CVPR 2024! 🌟🖼️🚀
📄 Title: CAMixerSR: Only Details Need More "Attention" 🌟🚀

📝 Description: CAMixerSR is a new approach integrating content-aware accelerating framework and token mixer design, to pursue more efficient SR inference via assigning convolution for simple regions but window-attention for complex textures. It exhibits excellent generality and attains competitive results among state-of-the-art models with better complexity-performance trade-offs on large-image SR, lightweight SR, and omnidirectional-image SR.

👥 Authors: Yan Wang, Shijie Zhao, Yi Liu, Junlin Li, and Li Zhang

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: CAMixerSR: Only Details Need More "Attention" (2402.19289)

🔗 Repository: https://github.com/icandle/CAMixerSR

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Image Enhancement Collection: DmitryRyumin/image-enhancement-65ee1cd2fe1c0c877ae55d28

🔍 Keywords: #CAMixerSR #SuperResolution #WindowAttention #ImageEnhancement #CVPR2024 #DeepLearning #Innovation
posted an update 2 months ago
view post
Post
🚀🎭🌟 New Research Alert - ICLR 2024! 🌟🎭 🚀
📄 Title: GPAvatar: Generalizable and Precise Head Avatar from Image(s) 🌟🚀

📝 Description: GPAvatar's objective is to faithfully replicate head avatars while providing precise control over expressions and postures.

👥 Authors: Xuangeng Chu et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: GPAvatar: Generalizable and Precise Head Avatar from Image(s) (2401.10215)

🔗 Github Page: https://xg-chu.github.io/project_gpavatar
🔗 Repository: https://github.com/xg-chu/GPAvatar

🔗 Video: https://www.youtube.com/watch?v=7A3DMaB6Zk0

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #GPAvatar #MTA #Synthesis #LipSyncing #Expressions #HighResolutionVideos #ICLR2024 #DeepLearning #Animation #Innovation
  • 1 reply
·
posted an update 2 months ago
view post
Post
🚀🎭🌟 New Research Alert - ICLR 2024! 🌟🎭 🚀
📄 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis 🌟🚀

👥 Authors: Zhenhui Ye et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis (2401.08503)

🔗 Github Page: https://real3dportrait.github.io/
🔗 Repository: https://github.com/yerfor/Real3DPortrait

🔥 Model 🤖: ameerazam08/Real3DPortrait

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #Real3D-Potrait #I2P #HTB-SR #A2M #Synthesis #LipSyncing #HighResolutionVideos #ICLR2024 #DeepLearning #Animation #Innovation
posted an update 2 months ago
view post
Post
🚀😈🌟 New Research Alert - CVPR 2024! 🌟😈 🚀
📄 Title: SyncTalk: The Devil 😈 is in the Synchronization for Talking Head Synthesis 🌟🚀

📝 Description: SyncTalk synthesizes synchronized talking head videos, employing tri-plane hash representations to maintain subject identity. It can generate synchronized lip movements, facial expressions, and stable head poses, and restores hair details to create high-resolution videos.

👥 Authors: Ziqiao Peng et al.

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis (2311.17590)

🔗 Github Page: https://ziqiaopeng.github.io/synctalk
🔗 Repository: https://github.com/ZiqiaoPeng/SyncTalk

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #TalkingHeads #Synthesis #TriPlaneHash #FacialExpressions #LipSyncing #HighResolutionVideos #CVPR2024 #DeepLearning #Animation #Innovation
posted an update 3 months ago
view post
Post
🚀🎬🌟 New Research Alert - CVPR 2024! 🌟🎬 🚀
📄 Title: GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians 🌟🚀

👥 Authors: Liangxiao Hu et al.

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🔗 Paper: GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians (2312.02134)

🔗 Github Page: https://huliangxiao.github.io/GaussianAvatar
🔗 Repository: https://github.com/huliangxiao/GaussianAvatar

🔗 Video: https://www.youtube.com/watch?v=a4g8Z9nCF-k

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #GaussianAvatar #3DGaussians #HumanAvatarModeling #PoseDependentAppearance #DynamicAppearanceModeling #MotionEstimation #MonocularSettings #AppearanceQuality #RenderingEfficiency #CVPR2024 #DeepLearning #Animation #Innovation
posted an update 3 months ago
view post
Post
🚀💃🌟 New Research Alert - CVPR 2024! 🌟🕺 🚀
📄 Title: MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model 🌟🚀

👥 Authors: @junhao910323 , @hansyan et al.

📅 Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA 🇺🇸

🤗 Demo: zcxu-eric/magicanimate

🔗 Paper: MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model (2311.16498)
🔗 Github Page: https://showlab.github.io/magicanimate/
🔗 Repository: https://github.com/magic-research/magic-animate

🔥 Model 🤖: zcxu-eric/MagicAnimate

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36

🔍 Keywords: #MagicAnimate #DiffusionModel #HumanImageAnimation #CVPR2024 #Diffusion #DeepLearning #Innovation
posted an update 3 months ago
view post
Post
🌟🎭✨ Exciting News! The Latest in Expressive Video Portrait Generation! 🌟🎭✨

📄 Title: EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

👥 Authors: Linrui Tian, @lucaskingjade , Bang Zhang, and @Liefeng

🔗 Paper: EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions (2402.17485)
🔗 Github Page: https://humanaigc.github.io/emote-portrait-alive
🔗 Repository: https://github.com/HumanAIGC/EMO

🔍 Keywords: #EMO #EmotePortrait #Audio2VideoDiffusion #ExpressiveAnimations #VideoGeneration #DigitalArt #HumanExpression #ComputerVision #DeepLearning #AI

🚀 Added to the Avatars Collection: DmitryRyumin/avatars-65df37cdf81fec13d4dbac36
posted an update 3 months ago
view post
Post
🚀🔥🌟 New Research Alert - ICLR 2024! 🌟🔥🚀
📄 Title: FuseChat: Revolutionizing Chat Models Fusion 🌟🚀

👥 Authors: @Wanfq , @passerqxj et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: FuseChat: Knowledge Fusion of Chat Models (2402.16107)
🔗 Repository: https://github.com/fanqiwan/FuseLLM

🔥 Models 🤖:
1️⃣ FuseChat-7B-VaRM: FuseAI/FuseChat-7B-VaRM
2️⃣ FuseChat-7B-Slerp: FuseAI/FuseChat-7B-Slerp
3️⃣ OpenChat-3.5-7B-Solar: FuseAI/OpenChat-3.5-7B-Solar
4️⃣ FuseChat-7B-TA: FuseAI/FuseChat-7B-TA
5️⃣ OpenChat-3.5-7B-Mixtral: FuseAI/OpenChat-3.5-7B-Mixtral

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #FuseChat #ChatModels #KnowledgeFusion #ICLR2024 #AI #Innovation #FuseLLM
replied to their post 3 months ago
view reply

Got it, added links to models on the HF hub. Will keep that in mind for the future. 😊

posted an update 3 months ago
view post
Post
🌟✨ Exciting Announcement: NVIDIA AI Foundation Models ✨🌟

🚀 Interact effortlessly with the latest SOTA AI model APIs, all optimized on the powerful NVIDIA accelerated computing stack-right from your browser! 💻⚡

🔗 Web Page: https://catalog.ngc.nvidia.com/ai-foundation-models

🌟🎯 Favorites:

🔹 Code Generation:
1️⃣ Code Llama 70B 📝🔥: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/codellama-70b
Model 🤖: codellama/CodeLlama-70b-hf

🔹 Text and Code Generation:
1️⃣ Gemma 7B 💬💻: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/gemma-7b
Model 🤖: google/gemma-7b
2️⃣ Yi-34B 📚💡: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/yi-34b
Model 🤖: 01-ai/Yi-34B

🔹 Text Generation:
1️⃣ Mamba-Chat 💬🐍: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/mamba-chat
Model 🤖: havenhq/mamba-chat
2️⃣ Llama 2 70B 📝🦙: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/llama2-70b
Model 🤖: meta-llama/Llama-2-70b

🔹 Text-To-Text Translation:
1️⃣ SeamlessM4T V2 🌐🔄: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/seamless-m4t2-t2tt
Model 🤖: facebook/seamless-m4t-v2-large

🔹 Image Generation:
1️⃣ Stable Diffusion XL 🎨🔍: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/sdxl

🔹 Image Conversation:
1️⃣ NeVA-22B 🗨️📸: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/neva-22b

🔹 Image Classification and Object Detection:
1️⃣ CLIP 🖼️🔍: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/clip

🔹 Voice Conversion:
1️⃣ Maxine Voice Font 🗣️🎶: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/voice-font

🔹 Multimodal LLM (MLLM):
1️⃣ Kosmos-2 🌐👁️: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-foundation/models/kosmos-2
  • 2 replies
·
posted an update 3 months ago
view post
Post
🎉✨ Exciting Research Alert! YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information 🚀

YOLOv9 is the latest breakthrough in object detection!

📄 Title: YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

👥 Authors: Chien-Yao Wang et al.
📅 Published: ArXiv, February 2024

🔗 Paper: YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information (2402.13616)
🔗 Model 🤖: adonaivera/yolov9
🔗 Repo: https://github.com/WongKinYiu/yolov9

🚀 Don't miss out on this cutting-edge research! Explore YOLOv9 today and stay ahead of the curve in the dynamic world of computer vision. 🌟

🔍 Keywords: #YOLOv9 #ObjectDetection #DeepLearning #ComputerVision #Innovation #Research #ArtificialIntelligence
  • 1 reply
·
posted an update 3 months ago
view post
Post
🚀🔥🌟 New Research Alert - ICLR 2024! 🌟🔥🚀
📄 Title: FasterViT: Fast Vision Transformers with Hierarchical Attention

👥 Authors: @ahatamiz , @slivorezzz et al.

📅 Conference: ICLR, May 7-11, 2024 | Vienna, Austria 🇦🇹

🔗 Paper: FasterViT: Fast Vision Transformers with Hierarchical Attention (2306.06189)

🔗 Model 🤖 : nvidia/FasterViT
🔗 Repo: https://github.com/NVlabs/FasterViT

📚 More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

🔍 Keywords: #VisionTransformers #DeepLearning #ComputerVision #ICLR2024 #MachineLearning #HierarchicalAttention #NeuralNetworks #Research #ArtificialIntelligence #Innovation
posted an update 3 months ago
view post
Post
📢 New Research Alert - AAAI 2024!
📄 Title: CARAT: Contrastive Feature Reconstruction and Aggregation for Multi-Modal Multi-Label Emotion Recognition

👥 Authors: Cheng Peng et al.

📅 Conference: AAAI, February 20-27, 2024 | Vancouver, Canada 🇨🇦

🔗 Paper: https://arxiv.org/abs/2312.10201

🔗 Repository: https://github.com/chengzju/CARAT

📚 More Papers: Explore a collection of exciting papers presented at AAAI 2024 and other conferences in the repositories:
- AAAI 2024 Papers: https://github.com/DmitryRyumin/AAAI-2024-Papers, @DmitryRyumin
- Other Conferences: DmitryRyumin/NewEraAI-Papers

🔍 Keywords: #EmotionRecognition #MultiModal #AI #Research #AAAI2024 #MachineLearning
posted an update 3 months ago
posted an update 3 months ago