Ranking of LLMs for agentic tasks
Submit and evaluate text-based models
Display Visual Document Retrieval leaderboard
Generate code from a description