YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published 3 days ago • 53
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published 9 days ago • 206
S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Paper • 2503.05085 • Published 7 days ago • 45
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 7 days ago • 72
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 7 days ago • 104
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper • 2503.03983 • Published 8 days ago • 22
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 7 days ago • 60
Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection Paper • 2503.01449 • Published 11 days ago • 4
DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion Paper • 2503.01183 • Published 11 days ago • 26
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 11 days ago • 72