FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving Paper • 2502.20238 • Published 11 days ago • 24
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Paper • 2502.15676 • Published 17 days ago • 3
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published Dec 27, 2024 • 82
Neural Amortized Inference for Nested Multi-agent Reasoning Paper • 2308.11071 • Published Aug 21, 2023 • 3
MMToM-QA: Multimodal Theory of Mind Question Answering Paper • 2401.08743 • Published Jan 16, 2024 • 1
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond Paper • 2403.14734 • Published Mar 21, 2024 • 21
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 37