|
--- |
|
license: mit |
|
datasets: |
|
- HuggingFaceFW/fineweb |
|
pipeline_tag: text-generation |
|
--- |
|
# Tiny-LLM |
|
|
|
A Tiny LLM model with just 10 Million parameters, this is probably one of the small LLM arounds, and it is functional. |
|
|
|
## Pretraining |
|
|
|
Tiny-LLM was trained on 32B tokens of the Fineweb dataset, with a context length of 1024 tokens. |
|
|
|
## Getting Started |
|
|
|
To start using these models, you can simply load them via the Hugging Face `transformers` library: |
|
|
|
```python |
|
import torch |
|
from transformers import AutoModelForCausalLM, AutoTokenizer |
|
|
|
|
|
MODEL_NAME = "arnir0/Tiny-LLM" |
|
|
|
tokenizer = AutoTokenizer.from_pretrained(MODEL_NAME) |
|
model = AutoModelForCausalLM.from_pretrained(MODEL_NAME) |
|
|
|
def generate_text(prompt, model, tokenizer, max_length=512, temperature=1, top_k=50, top_p=0.95): |
|
inputs = tokenizer.encode(prompt, return_tensors="pt") |
|
|
|
outputs = model.generate( |
|
inputs, |
|
max_length=max_length, |
|
temperature=temperature, |
|
top_k=top_k, |
|
top_p=top_p, |
|
do_sample=True |
|
) |
|
|
|
|
|
generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True) |
|
return generated_text |
|
|
|
def main(): |
|
# Define your prompt |
|
prompt = "According to all known laws of aviation, there is no way a bee should be able to fly." |
|
|
|
generated_text = generate_text(prompt, model, tokenizer) |
|
|
|
print(generated_text) |
|
|
|
if __name__ == "__main__": |
|
main() |
|
``` |