File size: 1,404 Bytes
6bd0f70 c576075 6bd0f70 c576075 cf6f801 712ee75 cf6f801 c576075 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 |
---
base_model:
- aihub-app/zyte-1B
- appvoid/palmer-002-2401
- vihangd/DopeyTinyLlama-1.1B-v1
tags:
- mergekit
- merge
---
# palmer
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
```
### Evaluation
ARC-C OBQA HellaSwag PIQA Winogrande Average
tinyllama | 0.3029 | 0.3600 | 0.5935 | 0.7329 | 0.5959 | 0.5170 |
palmer-002-2401 | 0.3311 | 0.3600 | 0.5981 | 0.7416 | 0.6006 | 0.5266 |
babbage-002 | 0.3285 | 0.3620 | 0.6380 | 0.7606 | 0.6085 | 0.5395 |
palmer-003 | 0.3370 | 0.3740 | 0.6128 | 0.7486 | 0.6535 | 0.5451 |
```
## Merge Details
### Merge Method
This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
### Models Merged
The following models were included in the merge:
* [aihub-app/zyte-1B](https://huggingface.co/aihub-app/zyte-1B)
* [appvoid/palmer-002-2401](https://huggingface.co/appvoid/palmer-002-2401)
* [vihangd/DopeyTinyLlama-1.1B-v1](https://huggingface.co/vihangd/DopeyTinyLlama-1.1B-v1)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
models:
- model: aihub-app/zyte-1B
parameters:
weight: 1.0
- model: appvoid/palmer-002-2401
parameters:
weight: 0.75
- model: vihangd/DopeyTinyLlama-1.1B-v1
parameters:
weight: 0.75
merge_method: linear
dtype: float16
```
|