A newer version of this model is available: MihaiPopa-1/CinnabarLM-4M-Base

CinnabarLM

CinnabarLM is a tiny, 4M-parameter LLM trained for ~33 minutes on a T4 GPU (on Colab)! It's only 16 MB in size!

Why?

Because it's a good idea to make tiny LLMs. Some people already did with MicroLM, Spark 4 5M and Tenete 8M, but not myself!

Not Instruction-Tuned: It's only a base model, so it only completes text.
English-Only: It's trained on English data (FineWeb), it's NOT multilingual.
Not a Standard Model: It's NOT a Qwen/Llama/GPT model. Standard Transformers can't recognize this!
Preview: This is a preview version, it generates gibberish often. CinnabarLM 1 will solve this with Llama.

It's trained on 80 million tokens of FineWeb (CC-MAIN-2025-26 snapshot), and the knowledge cutoff is June 2025.
The name "CinnabarLM" that I picked was made by combining "Cinnabar" (the new block from the Chaos Cubed drop in Minecraft) + "LM" (Language Model)