|
--- |
|
base_model: |
|
- aihub-app/zyte-1B |
|
- appvoid/palmer-002-2401 |
|
- vihangd/DopeyTinyLlama-1.1B-v1 |
|
tags: |
|
- mergekit |
|
- merge |
|
|
|
--- |
|
# palmer |
|
|
|
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). |
|
|
|
``` |
|
### Evaluation |
|
ARC-C OBQA HellaSwag PIQA Winogrande Average |
|
tinyllama | 0.3029 | 0.3600 | 0.5935 | 0.7329 | 0.5959 | 0.5170 | |
|
palmer-002-2402 | 0.3311 | 0.3600 | 0.5981 | 0.7416 | 0.6006 | 0.5266 | |
|
tinyllama-chat | 0.3285 | 0.3740 | 0.6037 | 0.7448 | 0.6022 | 0.5306 | |
|
zyte-1b | 0.3353 | 0.3700 | 0.6086 | 0.7541 | 0.5998 | 0.5335 | |
|
babbage-002 | 0.3285 | 0.3620 | 0.6380 | 0.7606 | 0.6085 | 0.5395 | |
|
palmer-003 | 0.3370 | 0.3740 | 0.6128 | 0.7486 | 0.6535 | 0.5451 | |
|
``` |
|
|
|
## Merge Details |
|
### Merge Method |
|
|
|
This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method. |
|
|
|
### Models Merged |
|
|
|
The following models were included in the merge: |
|
* [aihub-app/zyte-1B](https://huggingface.co/aihub-app/zyte-1B) |
|
* [appvoid/palmer-002-2401](https://huggingface.co/appvoid/palmer-002-2401) |
|
* [vihangd/DopeyTinyLlama-1.1B-v1](https://huggingface.co/vihangd/DopeyTinyLlama-1.1B-v1) |
|
|
|
### Configuration |
|
|
|
The following YAML configuration was used to produce this model: |
|
|
|
```yaml |
|
models: |
|
- model: aihub-app/zyte-1B |
|
parameters: |
|
weight: 1.0 |
|
- model: appvoid/palmer-002-2401 |
|
parameters: |
|
weight: 0.75 |
|
- model: vihangd/DopeyTinyLlama-1.1B-v1 |
|
parameters: |
|
weight: 0.75 |
|
merge_method: linear |
|
dtype: float16 |
|
``` |
|
|