Distil Gemma 2 2b

This model is a gemma 2 2b model distilled from google/gemma-2-9b-it and finetuned on the tome.

image/webp

Prompt Template

ChatML

<|im_start|>system
{system}<|im_end|>
<|im_start|>user
{user}<|im_end|>
<|im_start|>assistant

Training Information

This model trained on 8x Nvidia H100 NVL for the equivalent of 120 GPU hours.

  • Loss Achieved: 0.27
  • Epochs: 3

Checkpoints are available in the repo to continue training

Evals

IN PROGRESS

Downloads last month
18
Safetensors
Model size
3.2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for macadeliccc/distil-gemma-2-2b

Base model

google/gemma-2-2b
Finetuned
(512)
this model