SEAP: Training-free Sparse Expert Activation Pruning Unlock the Brainpower of Large Language Models Paper • 2503.07605 • Published 4 days ago • 63
PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs Paper • 2502.00963 • Published Feb 3 • 16
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding Paper • 2502.08946 • Published 30 days ago • 184
Revealing the Barriers of Language Agents in Planning Paper • 2410.12409 • Published Oct 16, 2024 • 27
Exploring Model Kinship for Merging Large Language Models Paper • 2410.12613 • Published Oct 16, 2024 • 21
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception Paper • 2410.12628 • Published Oct 16, 2024 • 35
HumanEval-V: Benchmarking High-Level Visual Reasoning with Complex Diagrams in Coding Tasks Paper • 2410.12381 • Published Oct 16, 2024 • 44
Towards Natural Image Matting in the Wild via Real-Scenario Prior Paper • 2410.06593 • Published Oct 9, 2024 • 3
Empirical Study of Mutual Reinforcement Effect and Application in Few-shot Text Classification Tasks via Prompt Paper • 2410.09745 • Published Oct 13, 2024 • 3
GS^3: Efficient Relighting with Triple Gaussian Splatting Paper • 2410.11419 • Published Oct 15, 2024 • 12
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning Paper • 2410.09754 • Published Oct 13, 2024 • 8
Towards Synergistic, Generalized, and Efficient Dual-System for Robotic Manipulation Paper • 2410.08001 • Published Oct 10, 2024 • 4
EchoPrime: A Multi-Video View-Informed Vision-Language Model for Comprehensive Echocardiography Interpretation Paper • 2410.09704 • Published Oct 13, 2024 • 13
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 13
SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI Paper • 2410.11096 • Published Oct 14, 2024 • 13
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models Paper • 2410.11710 • Published Oct 15, 2024 • 20
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices Paper • 2410.11795 • Published Oct 15, 2024 • 18
What Matters in Transformers? Not All Attention is Needed Paper • 2406.15786 • Published Jun 22, 2024 • 31
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation Paper • 2410.11779 • Published Oct 15, 2024 • 26