chlee10
/

T3Q-MSlerp-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

T3Q-MSlerp-13B / README.md

chlee10's picture

Update README.md

332cfb7 verified 8 months ago

|

history blame contribute delete

879 Bytes

	---
	license: apache-2.0
	---

	## T3Q-MSlerp-13B

	T3Q-MSlerp-13B is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
	* [zhengr/MixTAO-7Bx2-MoE-Instruct-v7.0](https://huggingface.co/zhengr/MixTAO-7Bx2-MoE-Instruct-v7.0)
	* [yunconglong/13B_MATH_DPO](https://huggingface.co/yunconglong/13B_MATH_DPO)


	Model Developers Chihoon Lee(chlee10), T3Q


	```yaml

	slices:
	- sources:
	- model: zhengr/MixTAO-7Bx2-MoE-Instruct-v7.0
	layer_range: [0, 32]
	- model: yunconglong/13B_MATH_DPO
	layer_range: [0, 32]

	merge_method: slerp
	base_model: zhengr/MixTAO-7Bx2-MoE-Instruct-v7.0

	parameters:
	t:
	- filter: self_attn
	value: [0, 0.5, 0.3, 0.7, 1]
	- filter: mlp
	value: [1, 0.5, 0.7, 0.3, 0]
	- value: 0.5 # fallback for rest of tensors

	dtype: float16

	```