Edit model card

sean_test_merge_out

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the linear DARE merge method using mllm-dev/gpt2_f_experiment_0 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model:
  model:
    path: mllm-dev/gpt2_f_experiment_0
dtype: float16
merge_method: dare_linear
parameters:
  normalize: 1.0
slices:
- sources:
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_0
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_1
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_2
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_3
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_4
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_5
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_6
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_7
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_8
    parameters:
      weight: 0.1
  - layer_range: [0, 12]
    model:
      model:
        path: mllm-dev/gpt2_f_experiment_9
    parameters:
      weight: 0.1
Downloads last month
8
Safetensors
Model size
124M params
Tensor type
FP16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for mllm-dev/gpt2_m_experiment_dare_linear