Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
metadata
title: Leaderboard
emoji: π
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 4.19.2
app_file: app.py
pinned: false
license: unknown
This is the OpenGPT-X mutlilingual leaderboard source code repository. The leaderboard aims to provied an overview of LLM performance over various languages. The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele. To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta.