Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published 8 days ago • 24
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Paper • 2401.15687 • Published Jan 28 • 19
AppAgent: Multimodal Agents as Smartphone Users Paper • 2312.13771 • Published Dec 21, 2023 • 49