Mistral-11B-OmniMix / README.md
Undi95's picture
Update README.md
0f1e5b7
|
raw
history blame
No virus
1.6 kB
metadata
license: cc-by-nc-4.0
slices:
  - sources:
      - model: Undi95/Mistral-11B-OpenOrcaPlatypus
        layer_range: [0, 48]
      - model: "/content/drive/MyDrive/Mistral-11B-CC-Zephyr"
        layer_range: [0, 48]
merge_method: slerp
base_model: Undi95/Mistral-11B-OpenOrcaPlatypus
parameters:
  t:
    - filter: lm_head 
      value: [0.75]
    - filter: embed_tokens
      value: [0.75]
    - filter: self_attn
      value: [0.75, 0.25]
    - filter: mlp
      value:  [0.25, 0.75]
    - filter: layernorm
      value: [0.5, 0.5]
    - filter: modelnorm
      value: [0.75]
    - value: 0.5 # fallback for rest of tensors
dtype: float16

hf-causal-experimental (pretrained=/content/drive/MyDrive/Mistral-11B-Test), limit: None, provide_description: False, num_fewshot: 0, batch_size: 4

Task Version Metric Value Stderr
arc_challenge 0 acc 0.5597 ± 0.0145
acc_norm 0.5819 ± 0.0144
arc_easy 0 acc 0.8308 ± 0.0077
acc_norm 0.8215 ± 0.0079
hellaswag 0 acc 0.6371 ± 0.0048
acc_norm 0.8213 ± 0.0038
piqa 0 acc 0.8134 ± 0.0091
acc_norm 0.8275 ± 0.0088
truthfulqa_mc 1 mc1 0.3990 ± 0.0171
mc2 0.5685 ± 0.0155
winogrande 0 acc 0.7474 ± 0.0122

image/png