tinyllm
/

124M-0.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tinyllm commited on 16 days ago

Commit

90d6a7c

•

1 Parent(s): 2dc1ed2

add usage

Files changed (1) hide show

README.md +15 -0

README.md CHANGED Viewed

@@ -32,7 +32,22 @@ We would like to acknowledge the open-source frameworks [llm.c](https://github.c
 The model can be used in two primary ways:
 1. **With Hugging Face’s Transformers Library**
 2. **With llama.cpp**
 ## Disclaimer

 The model can be used in two primary ways:
 1. **With Hugging Face’s Transformers Library**
+   ```python
+   from transformers import pipeline
+   import torch
+   path = "tinyllm/124M-0.0"
+   prompt = "The sea is blue but it's his red sea"
+   generator = pipeline("text-generation", model=path,max_new_tokens = 30, repetition_penalty=1.3, model_kwargs={"torch_dtype": torch.bfloat16}, device_map="auto")
+   print(generator(prompt)[0]['generated_text'])
+   ```
 2. **With llama.cpp**
+  Generate a GGUF model file using this [tool](https://github.com/ggerganov/llama.cpp/blob/master/convert_hf_to_gguf.py) and use the generated GGUF file for inferencing.
+    ```python
+    python3 convert_hf_to_gguf.py models/mymodel/
+    ```
 ## Disclaimer