hermes2-pro-llama3-16b

This is a self-merge of NousResearch/Hermes-2-Pro-Llama-3-8B mergekit. For the purposes of experimentation.

Results

Another fail, need to read more about merging and look at more examples, and maybe try more with base models then SFT?

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

NousResearch/Hermes-2-Pro-Llama-3-8B

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
    - model: NousResearch/Hermes-2-Pro-Llama-3-8B
      layer_range: [0, 32]
  - sources:
    - model: NousResearch/Hermes-2-Pro-Llama-3-8B
      layer_range: [0, 32]
merge_method: passthrough
dtype: bfloat16
tokenizer_source: union

AtakanTekparmak
/

hermes2-pro-llama3-16b

hermes2-pro-llama3-16b

Results

Merge Details

Merge Method

Models Merged

Configuration

Finetuned from

Collection including AtakanTekparmak/hermes2-pro-llama3-16b

Llama 3 Experiments

hermes2-pro-llama3-16b

Results

Merge Details

Merge Method

Models Merged

Configuration

Finetuned from NousResearch/Hermes-2-Pro-Llama-3-8B

Collection including AtakanTekparmak/hermes2-pro-llama3-16b

Finetuned from