Running
on
CPU Upgrade
12.1k
π
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
VLMEvalKit Evaluation Results Collection
More advanced and challenging multi-task evaluation
VLMEvalKit Eval Results in video understanding benchmark
Compare Open LLM Leaderboard results