Text Generation
Transformers
Safetensors
Serbian
mistral
mergekit
Merge
text-generation-inference
conversational
Inference Endpoints
datatab commited on
Commit
e8aefdb
1 Parent(s): 77bdbe2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -10,6 +10,25 @@ tags:
10
  - merge
11
 
12
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  # merge
14
 
15
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -27,7 +46,7 @@ The following models were included in the merge:
27
  * [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
28
  * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
29
 
30
- ### Configuration
31
 
32
  The following YAML configuration was used to produce this model:
33
 
 
10
  - merge
11
 
12
  ---
13
+ # # Yugo55A-GPT
14
+
15
+ - **Developed by:** datatab
16
+ - **License:** mit
17
+
18
+
19
+ ## 🏆 Results
20
+ > Results obtained through the Serbian LLM evaluation, released by Aleksa Gordić: [serbian-llm-eval](https://github.com/gordicaleksa/serbian-llm-eval)
21
+ > * Evaluation was conducted on a 4-bit version of the model due to hardware resource constraints.
22
+
23
+ | MODEL | ARC-E | ARC-C | Hellaswag | BoolQ | Winogrande | OpenbookQA | PiQA |
24
+ |-----------|-------|-------|-----------|-------|------------|------------|-------|
25
+ | [Yugo55-GPT-v4-4bit](https://huggingface.co/datatab/Yugo55-GPT-v4-4bit/) | **51.41** | **36.00** | **57.51** | **80.92** | **65.75** | **34.70** | **70.54** |
26
+
27
+
28
+
29
+
30
+
31
+ -
32
  # merge
33
 
34
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
46
  * [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300)
47
  * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO)
48
 
49
+ ## 🧩 Configuration
50
 
51
  The following YAML configuration was used to produce this model:
52