Edit model card

image/png

OpenHermes-2.5-neural-chat-v3-3-Slerp

This is the model for OpenHermes-2.5-neural-chat-v3-3-Slerp. I used mergekit to merge models.

Prompt Templates

You can use these prompt templates, but I recommend using ChatML.

ChatML (OpenHermes-2.5-Mistral-7B):

<|im_start|>system
{system}<|im_end|>
<|im_start|>user
{user}<|im_end|>
<|im_start|>assistant
{asistant}<|im_end|>

neural-chat-7b-v3-3

### System:
{system}
### User:
{user}
### Assistant:

Yaml Config to reproduce

slices:
  - sources:
      - model: teknium/OpenHermes-2.5-Mistral-7B
        layer_range: [0, 32]
      - model: Intel/neural-chat-7b-v3-3
        layer_range: [0, 32]
merge_method: slerp
base_model: mistralai/Mistral-7B-v0.1
parameters:
  t:
    - filter: self_attn
      value: [0, 0.5, 0.3, 0.7, 1]
    - filter: mlp
      value: [1, 0.5, 0.7, 0.3, 0]
    - value: 0.5 # fallback for rest of tensors
dtype: bfloat16

Quantizationed versions

Quantizationed versions of this model is available thanks to TheBloke.

GPTQ
GGUF
AWQ

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 71.38
ARC (25-shot) 68.09
HellaSwag (10-shot) 86.2
MMLU (5-shot) 64.26
TruthfulQA (0-shot) 62.78
Winogrande (5-shot) 79.16
GSM8K (5-shot) 67.78

If you would like to support me:

☕ Buy Me a Coffee

Downloads last month
2,134
Safetensors
Model size
7.24B params
Tensor type
BF16
·

Merge of

Space using Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp 1

Collection including Weyaxi/OpenHermes-2.5-neural-chat-v3-3-Slerp

Evaluation results