README / README.md
SondosMB's picture
Update README.md
eb75549 verified
|
raw
history blame
393 Bytes
metadata
title: README
emoji: 📈
colorFrom: gray
colorTo: indigo
sdk: static
pinned: false

Open-LLM-Leaderboard: Open-Style Question Evaluation

We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability. You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator.