datatab
/

Yugo55A-GPT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

datatab commited on Mar 5

Commit

e8aefdb

•

1 Parent(s): 77bdbe2

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -10,6 +10,25 @@ tags:
 - merge
 ---
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -27,7 +46,7 @@ The following models were included in the merge:
 * [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
 * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
-### Configuration
 The following YAML configuration was used to produce this model:

 - merge
 ---
+# # Yugo55A-GPT
+- **Developed by:** datatab
+- **License:** mit
+## 🏆 Results
+> Results obtained through the Serbian LLM evaluation, released by Aleksa Gordić: [serbian-llm-eval](https://github.com/gordicaleksa/serbian-llm-eval)
+> * Evaluation was conducted on a 4-bit version of the model due to hardware resource constraints.
+|    MODEL       | ARC-E | ARC-C | Hellaswag | BoolQ | Winogrande | OpenbookQA | PiQA  |
+|-----------|-------|-------|-----------|-------|------------|------------|-------|
+| [Yugo55-GPT-v4-4bit](https://huggingface.co/datatab/Yugo55-GPT-v4-4bit/) | **51.41**  | **36.00**  |   **57.51**  | **80.92** | **65.75**       | **34.70**        | **70.54**  |
+-
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 * [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
 * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
+## 🧩 Configuration
 The following YAML configuration was used to produce this model: