Update README.md
Browse files
README.md
CHANGED
@@ -92,7 +92,7 @@ widget:
|
|
92 |
|
93 |
> 400MB of buzz: pure Python programming nectar! 🍯
|
94 |
|
95 |
-
This model is the general pre-trained checkpoint `BEE-spoke-data/smol_llama-101M-GQA` trained on a deduped version of `pypi` for +1 epoch.
|
96 |
|
97 |
- Its architecture is the same as the base, with some new Python-related tokens added to vocab prior to training.
|
98 |
- It can generate basic Python code and markdown in README style, but will struggle with harder planning/reasoning tasks
|
|
|
92 |
|
93 |
> 400MB of buzz: pure Python programming nectar! 🍯
|
94 |
|
95 |
+
This model is the general pre-trained checkpoint `BEE-spoke-data/smol_llama-101M-GQA` trained on a deduped version of `pypi` for +1 epoch. Play with the model in [this demo space](https://huggingface.co/spaces/BEE-spoke-data/beecoder-playground).
|
96 |
|
97 |
- Its architecture is the same as the base, with some new Python-related tokens added to vocab prior to training.
|
98 |
- It can generate basic Python code and markdown in README style, but will struggle with harder planning/reasoning tasks
|