Model | DarijaMMLU | DarijaHellaSwag | Belebele Ary | Sentiment Analysis | DoDa-10k (Translation) | MArSum (Summarization) (LLM as a judge) |
|
BLEU | chrF | ||||||
jais-family-1p3b-chat | 35.39 | 32.51 | 38.33 | 45.29 | 00.13 | 06.18 | 00.50 |
jais-family-2p7b-chat | 37.44 | 34.49 | 44.11 | 51.56 | 00.25 | 07.46 | 00.90 |
gemma-2-2b-it | 28.58 | 32.42 | 25.22 | 53.36 | 00.10 | 04.96 | 06.80 |
Atlas-Chat-2B | 44.97 | 41.48 | 53.89 | 73.99 | 22.76 | 44.86 | 55.22 |
jais-family-6p7b-chat | 39.96 | 41.57 | 51.22 | 56.78 | 00.73 | 11.85 | 03.02 |
jais-adapted-7b-chat | 39.30 | 35.19 | 43.67 | 52.72 | 00.60 | 09.43 | 02.82 |
jais-family-13b-chat | 45.11 | 43.90 | 58.67 | 41.73 | 00.92 | 11.71 | 01.77 |
jais-adapted-13b-chat | 45.20 | 40.65 | 49.67 | 66.68 | 00.87 | 10.52 | 01.92 |
AceGPT-7b-chat | 35.98 | 36.57 | 30.11 | 40.23 | 00.44 | 11.33 | 02.28 |
AceGPT-13b-chat | 41.09 | 38.35 | 33.11 | 59.58 | 00.98 | 16.70 | 02.80 |
gemma-2-9b-it | 35.91 | 42.43 | 31.00 | 59.87 | 03.10 | 19.16 | 13.81 |
Llama-3.1-8B-Instruct | 44.13 | 38.24 | 47.00 | 44.08 | 00.92 | 14.19 | 01.28 |
Atlas-Chat-9B | 58.23 | 57.75 | 74.56 | 81.89 | 28.08 | 50.48 | 59.76 |