innocent-charles's picture
Update README.md
5bcbf2b verified

A newer version of the Streamlit SDK is available: 1.40.2

Upgrade
metadata
title: Swahili Llm Leaderboard
emoji: πŸ”₯
colorFrom: gray
colorTo: green
sdk: streamlit
sdk_version: 1.34.0
app_file: app.py
pinned: false
license: apache-2.0

πŸ† Swahili LLM-Leaderboard

A joint community effort to create a Swahili central leaderboard for LLMs. These models are currently evaluated on the Open LLM Leaderboard. Contributions and corrections are welcome!
We refer to a model being "open" if it can be locally deployed and used commercially.

Interactive Dashboard

https://swahili-llm-leaderboard.streamlit.app/

Leaderboard

Model Name Publisher Open? Basemodel Average ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K
Mollel/Swahili Gemma Mollel Yes Gemma 61.32 58.96 76.4 61.02 52.1 75.61 43.82
LeroyDyer/Mixtral_AI_SwahiliTron_7b leroy Samuel Dyer Yes Mixtral 61.14 57.08 81.59 58.49 60.66 75.53 33.51
Mollel/Swahili-Alpaca-Llama-3-8b_16bit Mollel Yes Llama-3 60.97 57.25 78.03 63.97 50.13 76.87 39.58
mwitiderrick/SwahiliInstruct-v0.1 Derrick Yes Mistral 58.92 57.59 80.92 57 58.08 74.66 25.25
mwitiderrick/SwahiliInstruct-v0.2 Derrick Yes Mistral 54.25 55.2 78.22 50.3 57.08 73.24 11.45

How to Contribute

We are always happy for contributions! You can contribute by the following:

  • table work (don't forget the links):
    • filling in missing entries
    • adding a new model as a new row to the leaderboard. Please keep the descending order.
    • adding a new benchmark as a new column in the leaderboard and the benchmark to the benchmarks table. Please keep the descending order.
  • code work:
    • improving the existing code
    • requesting and implementing new features

Sponsorship

The benchmark is English-based, and we need support translating it into Swahili. We welcome sponsorships to help advance this endeavor. Your sponsorship would facilitate this essential translation effort, bridging language barriers and making the benchmark accessible to a broader audience. We're grateful for the dedication shown by our collaborators and aim to extend this impact further with the support of sponsors committed to advancing language technologies."