a482a73 f9180f4
1
2
3
4
5
6
--- license: mit --- A model trained using lilLM: https://github.com/CohleM/lilLM The model is ~300M parameters and trained on 9.5B tokens from OpenWebText: https://huggingface.co/datasets/Skylion007/openwebtext