ar08's picture
Create README.md
6aab33a verified
|
raw
history blame
1.08 kB
---
license: mit
language:
- en
---
## Model Details
- *Finetuned+Capable for laptop
### Model Description
------------
Capable for run in Low-end **laptop**
- **Developed by:** [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")
- **Finetuned from model [optional]:** [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")
## Uses
```python
from llama_cpp import Llama
llm = Llama(
model_path="path/to/llama",
# n_gpu_layers=-1, # Uncomment to use GPU acceleration
# seed=1337, # Uncomment to set a specific seed
# n_ctx=2048, # Uncomment to increase the context window
)
output = llm(
"Q: Name the planets in the solar system? A: ", # Prompt
max_tokens=32, # Generate up to 32 tokens, set to None to generate up to the end of the context window
stop=["Q:", "\n"], # Stop generating just before the model would generate a new question
echo=True # Echo the prompt back in the output
) # Generate a completion, can also call create_completion
print(output)
```