roneneldan/TinyStories
Viewer β’ Updated β’ 2.14M β’ 85.3k β’ 1.02k
eeny is a tiny language model that writes short children's stories β and runs in a browser tab, offline, with no GPU. At 999,328 total parameters it is, to our knowledge, among the smallest coherent story models, and it beats TinyStories-1M (a model ~3Γ larger by non-embedding params) on a fair, tokenizer-independent metric.
| metric | eeny | TinyStories-1M |
|---|---|---|
| total params | 999,328 | ~3M (1M non-embedding) |
| bits-per-byte (held-out TinyStories val, lower=better) | 0.625 | 0.707 |
| in-browser speed (WASM, single thread, no GPU) | 417 tok/s (2.4 ms/tok) | β |
| file size (int8 KNM) | 1.76 MB | β |
Bits-per-byte is tokenizer-independent (normalize cross-entropy by UTF-8 bytes), so the comparison is fair across different tokenizers.
Once upon a time, there was a little β girl named Lily. She loved to play outside in the sun and
watch the birds fly high up in the sky. One day, she saw an old man sitting ...
The cat sat on the β tree and said, "I'm sorry I was so sad. Can you forgive me?"
The cat smiled and gave the cat a big hug. From that day on, the cat always shared ...
| file | what |
|---|---|
eeny_int8.knm |
int8 weights, KNM1 v3 (1.76 MB) β for the Sprapp WASM engine |
eeny_final.pt |
full-precision PyTorch checkpoint ({cfg, model}) |
tokenizer.json |
4096-vocab BPE tokenizer |
Part of the Sprapp project β offline on-device tiny LMs in the browser (eeny / meeny / miny family). Trained on TinyStories (Eldan & Li, 2023).