This is a frankenmerge of Mihaiii/Pallas-0.5 . It was done using mergekit.

It works well with long system prompts.

It isn't generic in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension.

This model is trained on a private dataset.

Prompt Format:

SYSTEM: <ANY SYSTEM CONTEXT>
USER: 
ASSISTANT:

Merge config:

slices:
  - sources:
    - model: "Mihaiii/Pallas-0.5"
      layer_range: [0, 60]
  - sources:
    - model: "Mihaiii/Pallas-0.5"
      layer_range: [58, 60]
  - sources:
    - model: "Mihaiii/Pallas-0.5"
      layer_range: [55, 56]
merge_method: passthrough
dtype: bfloat16

Quants:

TheBloke/Pallas-0.5-frankenmerge-GGUF

TheBloke/Pallas-0.5-frankenmerge-GPTQ

TheBloke/Pallas-0.5-frankenmerge-AWQ

Downloads last month
28
Safetensors
Model size
36.1B params
Tensor type
BF16
·
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for Mihaiii/Pallas-0.5-frankenmerge

Finetuned
Mihaiii/Pallas-0.5
Finetuned
(4)
this model
Quantizations
5 models