cstr commited on
Commit
e1a3060
1 Parent(s): 2ce2af7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -51,6 +51,16 @@ From [Intel/low_bit_open_llm_leaderboard](https://huggingface.co/datasets/Intel/
51
  | Winogrande | 78.3 |
52
  | Average | 68.3 |
53
 
 
 
 
 
 
 
 
 
 
 
54
  ### AGIEval
55
  | Task |Version| Metric |Value| |Stderr|
56
  |------------------------------|------:|--------|----:|---|-----:|
 
51
  | Winogrande | 78.3 |
52
  | Average | 68.3 |
53
 
54
+ From [Occiglot Euro LLM Leaderboard](https://huggingface.co/spaces/occiglot/euro-llm-leaderboard)
55
+ | Model | 🇪🇺 Average ⬆️ | 🇩🇪 DE | 🇬🇧 EN | 🇬🇧ARC EN | 🇬🇧TruthfulQA EN | 🇬🇧Belebele EN | 🇬🇧HellaSwag EN | 🇬🇧MMLU EN | 🇩🇪ARC DE | 🇩🇪TruthfulQA DE | 🇩🇪Belebele DE | 🇩🇪HellaSwag DE | 🇩🇪MMLU DE |
56
+ |----------------------------------------------|----------------|--------|--------|-------------|------------------|----------------|----------------|------------|-------------|------------------|----------------|----------------|------------|
57
+ | mistral-community/Mixtral-8x22B-v0.1 | 68.3 | 66.81 | 72.87 | 70.56 | 52.29 | 93.89 | 70.41 | 77.17 | 63.9 | 29.31 | 92.44 | 77.9 | 70.49 |
58
+ | **cstr/Spaetzle-v85-7b** | 63.26 | 61.11 | 71.94 | 70.48 | 67.16 | 90.33 | 68.54 | 63.17 | 58.43 | 36.93 | 84.22 | 70.62 | 55.36 |
59
+ | cstr/Spaetzle-v60-7b | 63.32 | 60.95 | 71.65 | 69.88 | 66.24 | 90.11 | 68.43 | 63.59 | 58 | 37.31 | 84.22 | 70.09 | 55.11 |
60
+ | VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct| 64.49 | 60.07 | 74.71 | 74.49 | 66.19 | 91.67 | 74.55 | 66.65 | 59.37 | 29.57 | 88.56 | 66.43 | 56.44 |
61
+ | seedboxai/Llama-3-KafkaLM-8B-v0.1 | 62.27 | 59.67 | 69.75 | 69.03 | 58.14 | 90.78 | 64.35 | 66.43 | 57.66 | 30.33 | 85.89 | 66.88 | 57.58 |
62
+ | cstr/llama3-8b-spaetzle-v33 | 62.75 | 59.56 | 70.68 | 69.54 | 59.31 | 91.44 | 66.04 | 67.06 | 57.06 | 28.55 | 87.56 | 66.7 | 57.92 |
63
+
64
  ### AGIEval
65
  | Task |Version| Metric |Value| |Stderr|
66
  |------------------------------|------:|--------|----:|---|-----:|