Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,25 @@ tags:
|
|
10 |
- merge
|
11 |
|
12 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
# merge
|
14 |
|
15 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
@@ -27,7 +46,7 @@ The following models were included in the merge:
|
|
27 |
* [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
|
28 |
* [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
|
29 |
|
30 |
-
|
31 |
|
32 |
The following YAML configuration was used to produce this model:
|
33 |
|
|
|
10 |
- merge
|
11 |
|
12 |
---
|
13 |
+
# # Yugo55A-GPT
|
14 |
+
|
15 |
+
- **Developed by:** datatab
|
16 |
+
- **License:** mit
|
17 |
+
|
18 |
+
|
19 |
+
## 🏆 Results
|
20 |
+
> Results obtained through the Serbian LLM evaluation, released by Aleksa Gordić: [serbian-llm-eval](https://github.com/gordicaleksa/serbian-llm-eval)
|
21 |
+
> * Evaluation was conducted on a 4-bit version of the model due to hardware resource constraints.
|
22 |
+
|
23 |
+
| MODEL | ARC-E | ARC-C | Hellaswag | BoolQ | Winogrande | OpenbookQA | PiQA |
|
24 |
+
|-----------|-------|-------|-----------|-------|------------|------------|-------|
|
25 |
+
| [Yugo55-GPT-v4-4bit](https://huggingface.co/datatab/Yugo55-GPT-v4-4bit/) | **51.41** | **36.00** | **57.51** | **80.92** | **65.75** | **34.70** | **70.54** |
|
26 |
+
|
27 |
+
|
28 |
+
|
29 |
+
|
30 |
+
|
31 |
+
-
|
32 |
# merge
|
33 |
|
34 |
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
|
|
46 |
* [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
|
47 |
* [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
|
48 |
|
49 |
+
## 🧩 Configuration
|
50 |
|
51 |
The following YAML configuration was used to produce this model:
|
52 |
|