-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper โข 2402.17485 โข Published โข 193 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper โข 2312.01841 โข Published โข 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper โข 2311.16498 โข Published โข 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper โข 2312.02134 โข Published โข 2
Collections
Discover the best community collections!
Collections including paper arxiv:2404.10667
-
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 19 -
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
Paper โข 2409.01876 โข Published โข 2 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper โข 2312.13578 โข Published โข 29 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper โข 2312.03029 โข Published โข 26
-
sdasd112132/Vision-8B-MiniCPM-2_5-Uncensored-and-Detailed-4bit
Visual Question Answering โข Updated โข 219 โข 30 -
100
Idefics3
๐Generate text based on an image and prompt
-
37
Vilt Vqa
๐Ask questions about images and get answers
-
vikhyatk/moondream2
Image-Text-to-Text โข Updated โข 164k โข 1.08k
-
Rho-1: Not All Tokens Are What You Need
Paper โข 2404.07965 โข Published โข 93 -
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
Paper โข 2404.10667 โข Published โข 19 -
Instruction-tuned Language Models are Better Knowledge Learners
Paper โข 2402.12847 โข Published โข 26 -
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper โข 2402.09353 โข Published โข 28
-
Can Large Language Models Understand Context?
Paper โข 2402.00858 โข Published โข 23 -
OLMo: Accelerating the Science of Language Models
Paper โข 2402.00838 โข Published โข 84 -
Self-Rewarding Language Models
Paper โข 2401.10020 โข Published โข 147 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper โข 2401.17072 โข Published โข 25