SmolLM-135M model in full precision (BF16), fine-tuned on TinyStories. Trained for 12k steps on 200k train stories with eval on the published validation split (~6.3 perplexity).

See ./generate_tinystories_fullprec.py for simple demo. This model is only intended for generating toy story examples and comparing quantization techniques.

Downloads last month
8
Safetensors
Model size
0.1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Dominic/smollm135_fullprec_tinystories

Finetuned
(121)
this model