--- base_model: - mlabonne/AlphaMonarch-7B - datatab/Yugo55-GPT-v4 - datatab/Yugo55-GPT-DPO-v1-chkp-300 - NousResearch/Nous-Hermes-2-Mistral-7B-DPO library_name: transformers tags: - mergekit - merge --- # # Yugo55A-GPT - **Developed by:** datatab - **License:** mit ## 🏆 Results > Results obtained through the Serbian LLM evaluation, released by Aleksa Gordić: [serbian-llm-eval](https://github.com/gordicaleksa/serbian-llm-eval) > * Evaluation was conducted on a 4-bit version of the model due to hardware resource constraints.

MODEL	ARC-E	ARC-C	Hellaswag	BoolQ	Winogrande	OpenbookQA	PiQA
Yugo55-GPT-v4-4bit	51.41	36.00	57.51	80.92	65.75	34.70	70.54
Yugo55A-GPT	51.52	37.78	57.52	84.40	65.43	35.60	69.43

# 🔗 Merge Details ### Merge Method > This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). > This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method. ### Models Merged The following models were included in the merge: * [mlabonne/AlphaMonarch-7B](https://huggingface.co/mlabonne/AlphaMonarch-7B) * [datatab/Yugo55-GPT-v4](https://huggingface.co/datatab/Yugo55-GPT-v4) * [datatab/Yugo55-GPT-DPO-v1-chkp-300](https://huggingface.co/datatab/Yugo55-GPT-DPO-v1-chkp-300) * [NousResearch/Nous-Hermes-2-Mistral-7B-DPO](https://huggingface.co/NousResearch/Nous-Hermes-2-Mistral-7B-DPO) ## 🧩 Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: datatab/Yugo55-GPT-v4 parameters: weight: 1.0 - model: datatab/Yugo55-GPT-DPO-v1-chkp-300 parameters: weight: 1.0 - model: mlabonne/AlphaMonarch-7B parameters: weight: 0.5 - model: NousResearch/Nous-Hermes-2-Mistral-7B-DPO parameters: weight: 0.5 merge_method: linear dtype: float16 ```