Generative Evaluation of Complex Reasoning in Large Language Models Paper • 2504.02810 • Published 7 days ago • 10
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation Paper • 2504.00043 • Published 11 days ago • 7
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 2 days ago • 55
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 2 days ago • 107
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Feb 13 • 84
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 10 days ago • 222
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published 14 days ago • 72
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published 21 days ago • 67
Frac-Connections: Fractional Extension of Hyper-Connections Paper • 2503.14125 • Published 23 days ago • 19
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 23 days ago • 116
Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? Paper • 2503.11207 • Published 27 days ago • 5
New Trends for Modern Machine Translation with Large Reasoning Models Paper • 2503.10351 • Published 28 days ago • 22
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published Feb 27 • 26
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper • 2503.00865 • Published Mar 2 • 62
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement Paper • 2502.16776 • Published Feb 24 • 6