Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4 • 27
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance Paper • 2401.15687 • Published Jan 28 • 20
AppAgent: Multimodal Agents as Smartphone Users Paper • 2312.13771 • Published Dec 21, 2023 • 49