A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 16 days ago • 48
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 20 items • Updated 4 days ago • 122
Running 2.41k 2.41k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures Paper • 2410.13754 • Published Oct 17, 2024 • 75
Harnessing Webpage UIs for Text-Rich Visual Understanding Paper • 2410.13824 • Published Oct 17, 2024 • 31
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents Paper • 2403.02502 • Published Mar 4, 2024 • 3
Running on CPU Upgrade 12.9k 12.9k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots