smol_bruin-7b / README.md
Azazelle's picture
Update README.md
26aa6ba
metadata
pipeline_tag: text-generation
tags:
  - mistral
  - merge
license: cc-by-4.0

Model Card for smol_bruin-7b

Slerp merge of go-bruins-v2 and smol-7b.

.yaml file for mergekit

slices:
  - sources:
      - model: rwitz/go-bruins-v2
        layer_range: [0, 32]
      - model: rishiraj/smol-7b
        layer_range: [0, 32]
merge_method: slerp
base_model: mistralai/Mistral-7B-v0.1
parameters:
  t:
    - filter: self_attn
      value: [0.44, 0.72, 0.61, 0.83, 1]
    - filter: mlp
      value: [0.56, 0.28, 0.39, 0.17, 0]
    - value: 0.5 # fallback for rest of tensors
dtype: float16