The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 3 days ago • 104
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published 3 days ago • 124
Expect the Unexpected: FailSafe Long Context QA for Finance Paper • 2502.06329 • Published 6 days ago • 118
Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models Paper • 2502.04404 • Published 10 days ago • 18
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper • 2502.05003 • Published 9 days ago • 40
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 17 days ago • 81
People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text Paper • 2501.15654 • Published 21 days ago • 11
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 18 days ago • 13
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity Paper • 2501.16295 • Published 20 days ago • 8
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 20 days ago • 343
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper • 2501.13826 • Published 24 days ago • 24
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 25 days ago • 83
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 24 days ago • 68