AngelSlayer-12B-Unslop-Mell-RPMax-DARKNESS-v3
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the linear DELLA merge method using TheDrummer/UnslopNemo-12B-v4 as a base.
Models Merged
The following models were included in the merge:
- DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
- ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.2
- inflatebot/MN-12B-Mag-Mell-R1
Configuration
The following YAML configuration was used to produce this model:
models:
- model: ArliAI/Mistral-Nemo-12B-ArliAI-RPMax-v1.2
parameters:
weight:
- filter: self_attn
value: 0.35
- filter: mlp
value: 0.2
- value: 0.25
density: 0.65
- model: TheDrummer/UnslopNemo-12B-v4
parameters:
weight:
- filter: self_attn
value: 0.3
- filter: mlp
value: 0.15
- value: 0.2
density: 0.6
- model: inflatebot/MN-12B-Mag-Mell-R1
parameters:
weight:
- filter: self_attn
value: 0.2
- filter: mlp
value: 0.35
- value: 0.3
density: 0.75
- model: DavidAU/MN-GRAND-Gutenberg-Lyra4-Lyra-12B-DARKNESS
parameters:
weight:
- filter: mlp
value: 0.35
- value: 0.25
density: 0.7
base_model: TheDrummer/UnslopNemo-12B-v4
merge_method: della_linear
dtype: bfloat16
chat_template: "chatml"
tokenizer_source: union
parameters:
normalize: true
int8_mask: true
epsilon: 0.05
lambda: 1
- Downloads last month
- 0
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.