README.md · openGPT-X/european-llm-leaderboard at 3cf41e9cd1999f17e8d72cc386d4b58a38edd885

metadata

title: Leaderboard
emoji: 👁
colorFrom: blue
colorTo: blue
sdk: gradio
sdk_version: 4.19.2
app_file: app.py
pinned: false
license: unknown

This is the OpenGPT-X mutlilingual leaderboard source code repository. The leaderboard aims to provied an overview of LLM performance over various languages. The basic task set consists of MMLU, ARC, HellaSwag, GSM8k, TruthfulQA and belebele. To make the results comparable to the Open LLM leaderboard (https://huggingface.co/open-llm-leaderboard) we selected the former five tasks based on our internal machine translations of the English base tasks, in addition to the high-quality multilingual benchmark belebele by Meta.