Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 3 days ago • 42
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 10 days ago • 97
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 89 items • Updated 23 days ago • 93
Running on CPU Upgrade 62 62 LeaderboardExplorer 🔎 Filter and display leaderboards based on selected criteria
Running on CPU Upgrade 55 55 Open PL LLM Leaderboard 🏆 Display and filter a leaderboard of language models
Running on CPU Upgrade 84 84 Open LLM Leaderboard Model Comparator 🏆 Compare Open LLM Leaderboard results