Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Paper • 2504.15271 • Published 3 days ago • 60
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 2 days ago • 46
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 27