appvoid
/

palmer-002.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

palmer-002.5 / README.md

appvoid's picture

Update README.md

cf6f801 verified 10 months ago

|

1.55 kB

	---
	base_model:
	- aihub-app/zyte-1B
	- appvoid/palmer-002-2401
	- vihangd/DopeyTinyLlama-1.1B-v1
	tags:
	- mergekit
	- merge

	---
	# palmer

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	```
	### Evaluation
	ARC-C OBQA HellaSwag PIQA Winogrande Average
	tinyllama \| 0.3029 \| 0.3600 \| 0.5935 \| 0.7329 \| 0.5959 \| 0.5170 \|
	palmer-002-2402 \| 0.3311 \| 0.3600 \| 0.5981 \| 0.7416 \| 0.6006 \| 0.5266 \|
	tinyllama-chat \| 0.3285 \| 0.3740 \| 0.6037 \| 0.7448 \| 0.6022 \| 0.5306 \|
	zyte-1b \| 0.3353 \| 0.3700 \| 0.6086 \| 0.7541 \| 0.5998 \| 0.5335 \|
	babbage-002 \| 0.3285 \| 0.3620 \| 0.6380 \| 0.7606 \| 0.6085 \| 0.5395 \|
	palmer-003 \| 0.3370 \| 0.3740 \| 0.6128 \| 0.7486 \| 0.6535 \| 0.5451 \|
	```

	## Merge Details
	### Merge Method

	This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.

	### Models Merged

	The following models were included in the merge:
	* [aihub-app/zyte-1B](https://huggingface.co/aihub-app/zyte-1B)
	* [appvoid/palmer-002-2401](https://huggingface.co/appvoid/palmer-002-2401)
	* [vihangd/DopeyTinyLlama-1.1B-v1](https://huggingface.co/vihangd/DopeyTinyLlama-1.1B-v1)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: aihub-app/zyte-1B
	parameters:
	weight: 1.0
	- model: appvoid/palmer-002-2401
	parameters:
	weight: 0.75
	- model: vihangd/DopeyTinyLlama-1.1B-v1
	parameters:
	weight: 0.75
	merge_method: linear
	dtype: float16
	```