MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization Paper • 2410.12957 • Published Oct 16 • 7
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration Paper • 2410.12183 • Published Oct 16 • 3
Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key Paper • 2410.10210 • Published Oct 14 • 3
MedMobile: A mobile-sized language model with expert-level clinical capabilities Paper • 2410.09019 • Published Oct 11 • 8