LlamaMix-7B-slerp / README.md
MohamedAtta-AI's picture
Upload folder using huggingface_hub
0409a9e verified
|
raw
history blame contribute delete
No virus
845 Bytes
metadata
license: apache-2.0
tags:
  - merge
  - mergekit
  - lazymergekit
  - ise-uiuc/Magicoder-CL-7B
  - meta-llama/Llama-2-7b-chat-hf

LlamaMix-7B-slerp

LlamaMix-7B-slerp is a merge of the following models using mergekit:

🧩 Configuration

slices:
  - sources:
      - model: ise-uiuc/Magicoder-CL-7B
        layer_range: [0, 16]
      - model: meta-llama/Llama-2-7b-chat-hf
        layer_range: [0, 16]
merge_method: slerp
base_model: meta-llama/Llama-2-7b-chat-hf
parameters:
  t:
    - filter: self_attn
      value: [0, 0.2, 0.3, 0.5, 1]
    - filter: mlp
      value: [1, 0.1, 0.2, 0.3, 0]
    - value: 0.5
dtype: bfloat16