medbotlm-v0.2 / README.md
ayan-sh003's picture
Update README.md
4373116 verified
---
base_model:
- ruslanmv/Medical-Llama3-8B
- HPAI-BSC/Llama3-Aloe-8B-Alpha
library_name: transformers
tags:
- mergekit
- merge
license: llama3
---
# llama3-medbotlm-v0.3
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [HPAI-BSC/Llama3-Aloe-8B-Alpha](https://huggingface.co/HPAI-BSC/Llama3-Aloe-8B-Alpha) as a base.
### Models Merged
The following models were included in the merge:
* [ruslanmv/Medical-Llama3-8B](https://huggingface.co/ruslanmv/Medical-Llama3-8B)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: ruslanmv/Medical-Llama3-8B
parameters:
weight: 0.50
- model: HPAI-BSC/Llama3-Aloe-8B-Alpha
parameters:
weight: 0.50
base_model: HPAI-BSC/Llama3-Aloe-8B-Alpha
merge_method: task_arithmetic
dtype: bfloat16
```