The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Paper • 2503.04606 • Published 3 days ago • 7
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published 4 days ago • 18
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 6 days ago • 65
Societal Alignment Frameworks Can Improve LLM Alignment Paper • 2503.00069 • Published 10 days ago • 16
Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding Paper • 2503.01842 • Published 6 days ago • 1
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published 7 days ago • 55
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published 17 days ago • 91
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 19 days ago • 65
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published 19 days ago • 36
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 21 days ago • 52
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 25 days ago • 182
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 25 days ago • 143
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published 26 days ago • 50
TransMLA: Multi-head Latent Attention Is All You Need Paper • 2502.07864 • Published 26 days ago • 46
Retrieval-augmented Large Language Models for Financial Time Series Forecasting Paper • 2502.05878 • Published 28 days ago • 39
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published 27 days ago • 47
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published about 1 month ago • 122