|
--- |
|
title: Leaderboard |
|
emoji: π |
|
colorFrom: blue |
|
colorTo: blue |
|
sdk: gradio |
|
sdk_version: 4.19.2 |
|
app_file: app.py |
|
pinned: false |
|
license: unknown |
|
--- |
|
|
|
|
|
This is the OpenGPT-X mutlilingual leaderboard source code repository. |
|
The leaderboard aims to provied an overview of LLM performance over various languages. |
|
The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele. |
|
To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta. |
|
|