Mabeck
/

Heidrun-Mistral-7B-chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mabeck commited on Feb 17

Commit

1e920ce

•

1 Parent(s): 207219d

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -23,15 +23,15 @@ Heidrun-Mistral-7B-chat is a chat-model based on [Heidrun-Mistral-7B-base](https
 It is a new SOTA Danish open-source LLM and shows very strong performance in logic and reasoning tasks.
 # Benchmarks
 The following benchmarks have been tested using [ScandEval](https://github.com/ScandEval/ScandEval). Rankings don't include merged models and GPT4 and GPT3.5 ranks 1 and 2:
 - **MMLU-da**: 35.66%+-0.85%/51.68+-0.63%, ranks 3rd
-- **DANSK**: 50.77%+-2.29%/34.05%+-1.78%, ranks 3rd=
-- **Hellaswag-da**: 29.18%+-0.99%/46.64%+-0.76%, ranks 4th
-Further evaluations will be tested.
 # Datasets
 This model is trained on Danish instruction datasets [danish-OpenHermes](Mabeck/danish-OpenHermes) and [skoleGPT](https://huggingface.co/datasets/kobprof/skolegpt-instruct), which have not been safeguarded or alligned.

 It is a new SOTA Danish open-source LLM and shows very strong performance in logic and reasoning tasks.
+Heidrun-7B ranks 1st among Danish open-sourced LLMs based on the [ScandEval](https://scandeval.com/mainland-scandinavian-nlg/) benchmark and shares a 1st place .
 # Benchmarks
 The following benchmarks have been tested using [ScandEval](https://github.com/ScandEval/ScandEval). Rankings don't include merged models and GPT4 and GPT3.5 ranks 1 and 2:
 - **MMLU-da**: 35.66%+-0.85%/51.68+-0.63%, ranks 3rd
+- **DANSK**: 50.80% ± 2.33% / 34.04% ± 1.76%, ranks 3rd=
+- **Hellaswag-da**: 29.18% ± 0.99%/ 46.64% ± 0.76%, ranks 4th
 # Datasets
 This model is trained on Danish instruction datasets [danish-OpenHermes](Mabeck/danish-OpenHermes) and [skoleGPT](https://huggingface.co/datasets/kobprof/skolegpt-instruct), which have not been safeguarded or alligned.