Spaces:
Running
Running
title: README | |
emoji: 📈 | |
colorFrom: gray | |
colorTo: indigo | |
sdk: static | |
pinned: false | |
# Open-LLM-Leaderboard: Open-Style Question Evaluation | |
We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability. | |
You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator. | |