Edit model card

SmolPlatypus-1.5B-Sorted

This is a merge of pre-trained language models created using mergekit.

The ToastyPigeon/SmolLlama-1.5B-Sorted stack merge was trained on the Open-Platypus dataset using axolotl QLora for approximately 2 hours on 2x RTX 3060.

This is a proof-of-concept model and should not be used for anything.

Merge Details

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

  • ToastyPigeon/SmolLlama-1.5B-Sorted + ToastyPigeon/SmolPlatypus-1.5B-Sorted-LoRA

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: ToastyPigeon/SmolLlama-1.5B-Sorted+ToastyPigeon/SmolPlatypus-1.5B-Sorted-LoRA
merge_method: passthrough
dtype: float16
Downloads last month
2,704
Safetensors
Model size
1.54B params
Tensor type
FP16
·
Invalid base_model specified in model card metadata. Needs to be a model id from hf.co/models.

Dataset used to train ToastyPigeon/SmolPlatypus-1.5B-Sorted