sean_test_merge_out
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the linear DARE merge method using mllm-dev/gpt2_f_experiment_0 as a base.
Models Merged
The following models were included in the merge:
- mllm-dev/gpt2_f_experiment_7
- mllm-dev/gpt2_f_experiment_6
- mllm-dev/gpt2_f_experiment_9
- mllm-dev/gpt2_f_experiment_2
- mllm-dev/gpt2_f_experiment_5
- mllm-dev/gpt2_f_experiment_8
- mllm-dev/gpt2_f_experiment_1
- mllm-dev/gpt2_f_experiment_4
- mllm-dev/gpt2_f_experiment_3
Configuration
The following YAML configuration was used to produce this model:
base_model:
model:
path: mllm-dev/gpt2_f_experiment_0
dtype: float16
merge_method: dare_linear
parameters:
normalize: 1.0
slices:
- sources:
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_0
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_1
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_2
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_3
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_4
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_5
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_6
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_7
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_8
parameters:
weight: 0.1
- layer_range: [0, 12]
model:
model:
path: mllm-dev/gpt2_f_experiment_9
parameters:
weight: 0.1
- Downloads last month
- 8
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for mllm-dev/gpt2_m_experiment_dare_linear
Merge model
this model