broskicodes commited on
Commit
2a434e9
1 Parent(s): 2cdbbd8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -6,4 +6,28 @@ language:
6
  - en
7
  tags:
8
  - text-generation-inference
9
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  - en
7
  tags:
8
  - text-generation-inference
9
+ ---
10
+
11
+ # Simple Stories
12
+ Simple Stories will be a series of small text generation models trained on the [TinyStories](https://huggingface.co/datasets/roneneldan/TinyStories) dataset.
13
+
14
+ The goal is to experiment with creating small language models that can perform highly specific tasks. In this case, the task is generating children's stories.
15
+
16
+ ## Model Details
17
+ The model has 4M parameters (Safetensors seems to have inflated this to 13M, I will look into why in the future). This model has not been fine-tuned for instructions. It will simply spew out text when asked. I will be working on an instruct model in the coming days.
18
+
19
+ The model is a decoder only transformer model with 4 decoder layers and 2 attention heads. The model was trained on only ~50MB of text and can already produce semi-coherent stories.
20
+
21
+ The code used to train the model can be found on my [github](https://github.com/broskicodes/slms). For now, this is also the only way to train and obtain the tokenizer necessary for encoding and decoding text. Check it out if you are interested.
22
+
23
+ ## Sample
24
+ Here is a short sample generated by the model.
25
+
26
+ `Once upon a time, there was a little girl called Daisy. Daisy wanted to go to the park with her mommy. She packed some yummy food and chirpies and carried them . Daisy was so excited for her mommy to try. The puppy and Mommy brought a big spoon to make souping. Daisy loved swimming and jun ate until she was disappointed. They began to start playing in the garden. They gathered around and ate and boot into the bread . As Daisy got hungry on the grass, she found some magic. She read more to see what was Luckily, Daisy was very impressed. When the lady opened the pot, something tickling to another. It was a rare. Daisy was so happy that she gave the tunately. Daisy was no longer scared. She knew she had to tell Mommy at the store. She took her to the soup and opened the tasty hot chocolate. When Daisy gave it to Daisy and princessed around a special spoon every day.`
27
+
28
+ No, the story doesn't fully make sense. But most of the words are valid English and the characters and overarching plot are consistent. This is progress :)
29
+
30
+ ## Going forward
31
+ The direct next step is creating a instruct model for interacting with and generating custom stories. After that I will continue working to improve the base model by increasing the amount of data it is trained on and continueing to experiment with different hyperparameters.
32
+
33
+ If you have any suggestions/questions, or you want to discuss anything about the model please reach out to me on twitter [@_broskitweets](https://twitter.com/_broskitweets).