GPT-2 ChatML FP32 (SFT on no_robots)

This is a fine-tuned GPT-2 model (124M parameters) trained on the human-curated SFT dataset HuggingFaceH4/no_robots using ChatML conversational formatting.

Model Details

  • Base Model: gpt2
  • Dataset: HuggingFaceH4/no_robots
  • Conversational Format: ChatML (<|im_start|> / <|im_end|>)
  • Training Epochs: 2 epochs
  • Eval Perplexity: 14.46

For GGUF quantized formats (including IQ4_NL and IQ3_XXS), please visit the GGUF repository: JustACluelessKid2/gpt2-chatml-fp32-GGUF.

Downloads last month
222
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JustACluelessKid2/gpt2-chatml-fp32

Finetuned
(2185)
this model
Quantizations
1 model

Dataset used to train JustACluelessKid2/gpt2-chatml-fp32