LH-Tech-AI
/

Pin

+---
+datasets:
+- starhopp3r/TinyChat
+language:
+- en
+pipeline_tag: text-generation
+tags:
+- tiny
+- chat
+- small
+- cpu
+- open
+- open-source
+- gpt2
+- gpt
+---
+# Welcome to 🤏 Pin Model Series
+The models in the LH-Tech AI Pin Series are very small models that were trained on starhopp3r/TinyChat.
+## Models overview
+| Model | Parameters | Training iters | Final Train Loss | Quality | Example Chat |
+|:----|:----|:----|:----|:----|:----|
+| Pin-5M  | 5.37M  | 1000 | 3.170788 | Poor | *User:* What is the weather like today? *Pin:* Yes, a bright day is shining and makes everything have a good day a lot. |
+| Pin-10M | 10.06M | 1500 | 2.562048 | Poor | *User:* What is the weather like today? *Pin:* That sounds nice, I agree, it is nice to talk about new ideas. |
+| Pin-15M | 14.84M | 1500 | 2.XXXXXX |  | *User:* What is the weather like today? *Pin:* XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX |
+| Pin-20M | XX.XXM | 1500 | 2.XXXXXX |  | *User:* What is the weather like today? *Pin:* XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX |
+| Pin-25M | XX.XXM | 1500 | 2.XXXXXX |  | *User:* What is the weather like today? *Pin:* XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX |
+## Training
+### Training data
+We trained on [starhopp3r/TinyChat](https://huggingface.co/datasets/starhopp3r/TinyChat) and used the gpt-2 tokenizer.
+### Training code/scripts
+You can find the full training code for the Pin Model Series in this repo.<br>
+**Tip:** If you want to train one of these models yourself, make sure to adjust the model config like this:
+| Model | n\_layer | n\_head | n\_embd | n\_inner |
+| :---- | :---- | :---- | :---- | :---- |
+| **Pin-5M** | 4 | 8 | 96 | 384 |
+| **Pin-10M** | 6 | 8 | 160 | 640 |
+| **Pin-15M** | *8* | *8* | *208* | *832* |
+| **Pin-20M** | *10* | *8* | *256* | *1024* |
+| **Pin-25M** | *12* | *12* | *288* | *1152* |
+Have fun :D
+### Training details
+We trained all these models in under 25 minutes on a single T4 GPU in a Kaggle Session.<br>
+So you are able to easily recreate all of the Pin model without having to launch a 8xH100 cluster 😂
+## How to use the model
+You can easily use the favorite model of the Pin series like this:
+1. Download `use.py` from this repo.
+2. Adjust the subfolder argument here:
+   ```python
+   answer = run_pin_inference(user_query, model_id="LH-Tech-AI/Pin", subfolder="Pin-5M") # use your favorite model here, e.g. "Pin-25M" or "Pin-15M"...
+   ```
+3. Adjust the input prompt here:
+   ```python
+   user_query = "What is the weather like today?" # insert your prompt here
+   ```
+4. Launch it with Torch installed in Python3.
+5. Have fun :D
+## Acknowledgements...
+...to:
+- HF Transformers
+- Kaggle for the T4 GPU
+- starhopp3r for his TinyChat dataset