Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement Paper • 2412.04003 • Published 17 days ago • 9 • 2
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Paper • 2412.01064 • Published 20 days ago • 25 • 3
Scaling Transformers for Low-Bitrate High-Quality Speech Coding Paper • 2411.19842 • Published 22 days ago • 10 • 3
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published 24 days ago • 13 • 2
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model Paper • 2411.19108 • Published 24 days ago • 17 • 2
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers Paper • 2411.18673 • Published 24 days ago • 8 • 2