Update README.md
Browse files
README.md
CHANGED
@@ -33,6 +33,7 @@ inference: true
|
|
33 |
<p><strong><a href="https://huggingface.co/spaces/afrizalha/Sasando-1" style="color: blue; font-family: Tahoma;">❕Go straight to the gradio demo❕</a></strong></p>
|
34 |
<p><em style="color: black; font-weight: bold;">This repo contains the 25M version.</em></p>
|
35 |
</center>
|
|
|
36 |
## 🎻 Welcome!
|
37 |
Sasando-1 is a tiny, highly experimental Indonesian text generator built using the Phi-3 architecture. It comes with two variations of microscopic sizes: 7M and 25M parameters. It is trained on a tightly-controlled Indo4B dataset filtered to only have 18000 unique words. The method is inspired by Microsoft's TinyStories paper which demonstrates that a tiny language model can produce fluent text when trained on tightly-controlled dataset.
|
38 |
|
|
|
33 |
<p><strong><a href="https://huggingface.co/spaces/afrizalha/Sasando-1" style="color: blue; font-family: Tahoma;">❕Go straight to the gradio demo❕</a></strong></p>
|
34 |
<p><em style="color: black; font-weight: bold;">This repo contains the 25M version.</em></p>
|
35 |
</center>
|
36 |
+
|
37 |
## 🎻 Welcome!
|
38 |
Sasando-1 is a tiny, highly experimental Indonesian text generator built using the Phi-3 architecture. It comes with two variations of microscopic sizes: 7M and 25M parameters. It is trained on a tightly-controlled Indo4B dataset filtered to only have 18000 unique words. The method is inspired by Microsoft's TinyStories paper which demonstrates that a tiny language model can produce fluent text when trained on tightly-controlled dataset.
|
39 |
|