Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
File size: 676 Bytes
2b62c4c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
---
title: Leaderboard
emoji: π
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 4.19.2
app_file: app.py
pinned: false
license: unknown
---
This is the OpenGPT-X mutlilingual leaderboard source code repository.
The leaderboard aims to provied an overview of LLM performance over various languages.
The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele.
To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta.
|