--- base_model: - Gryphe/Pantheon-RP-1.0-8b-Llama-3 - jondurbin/bagel-8b-v1.0 library_name: transformers tags: - mergekit - merge --- # merged This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the task_swapping merge method using [jondurbin/bagel-8b-v1.0](https://huggingface.co/jondurbin/bagel-8b-v1.0) as a base. ### Models Merged The following models were included in the merge: * [Gryphe/Pantheon-RP-1.0-8b-Llama-3](https://huggingface.co/Gryphe/Pantheon-RP-1.0-8b-Llama-3) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: jondurbin/bagel-8b-v1.0 dtype: bfloat16 merge_method: task_swapping slices: - sources: - layer_range: [0, 32] model: Gryphe/Pantheon-RP-1.0-8b-Llama-3 parameters: diagonal_offset: 2.0 weight: 0.9 - layer_range: [0, 32] model: jondurbin/bagel-8b-v1.0 ```