cstr
/

Spaetzle-v69-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cstr commited on Apr 17

Commit

cb0fc74

•

1 Parent(s): cf0fab9

Update README.md

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -40,6 +40,21 @@ It achieves (running quantized) in
 - German EQ Bench: Score (v2_de): 62.59 (Parseable: 171.0).
 - English EQ Bench: Score (v2): 76.43 (Parseable: 171.0).
 |                            Model                             |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
 |--------------------------------------------------------------|------:|------:|---------:|-------:|------:|
 |[Spaetzle-v69-7b](https://huggingface.co/cstr/Spaetzle-v69-7b)|  44.48|  75.84|     66.15|   46.59|  58.27|

 - German EQ Bench: Score (v2_de): 62.59 (Parseable: 171.0).
 - English EQ Bench: Score (v2): 76.43 (Parseable: 171.0).
+[Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard):
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cstr__Spaetzle-v69-7b)
+ |             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |72.87|
+|AI2 Reasoning Challenge (25-Shot)|69.54|
+|HellaSwag (10-Shot)              |86.77|
+|MMLU (5-Shot)                    |64.63|
+|TruthfulQA (0-shot)              |65.61|
+|Winogrande (5-shot)              |81.93|
+|GSM8k (5-shot)                   |68.76|
+Nous benchmark results:
 |                            Model                             |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
 |--------------------------------------------------------------|------:|------:|---------:|-------:|------:|
 |[Spaetzle-v69-7b](https://huggingface.co/cstr/Spaetzle-v69-7b)|  44.48|  75.84|     66.15|   46.59|  58.27|