Submitted by WZDavid 50 Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs · 20 authors 75 2
Submitted by SivilTaram 19 SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories? · 8 authors 6 1
Submitted by vztu 17 MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding · 7 authors 1
Submitted by zhendongucb 12 DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering · 3 authors 29 1
Submitted by Franck-Dernoncourt 6 Lizard: An Efficient Linearization Framework for Large Language Models · 12 authors 1
Submitted by HenghuiDing 6 AnyI2V: Animating Any Conditional Image with Motion Control · 4 authors 86 1
Submitted by crainone 3 Replacing thinking with tool usage enables reasoning in small language models · 3 authors 1
Submitted by MatteoFasulo 2 AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles · 3 authors 2 1
Submitted by hongzhizhang 2 RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning · 7 authors 13 1
Submitted by Xa9aX - GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities · 12 authors 1
Submitted by Gray1y - MST-Distill: Mixture of Specialized Teachers for Cross-Modal Knowledge Distillation · 6 authors 11 1