ar08
/

TinyLlama-1.1B-Chat-v1.0-GGUF-LAPTOP

Model card Files Files and versions Community

ar08 commited on May 4

Commit

6aab33a

•

1 Parent(s): b404c17

Create README.md

Files changed (1) hide show

README.md +36 -0

README.md ADDED Viewed

	@@ -0,0 +1,36 @@

+---
+license: mit
+language:
+- en
+---
+## Model Details
+- *Finetuned+Capable for laptop
+### Model Description
+------------
+Capable for run in Low-end **laptop**
+- **Developed by:** [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")
+- **Finetuned from model [optional]:** [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")
+## Uses
+```python
+from llama_cpp import Llama
+llm = Llama(
+      model_path="path/to/llama",
+      # n_gpu_layers=-1, # Uncomment to use GPU acceleration
+      # seed=1337, # Uncomment to set a specific seed
+      # n_ctx=2048, # Uncomment to increase the context window
+)
+output = llm(
+      "Q: Name the planets in the solar system? A: ", # Prompt
+      max_tokens=32, # Generate up to 32 tokens, set to None to generate up to the end of the context window
+      stop=["Q:", "\n"], # Stop generating just before the model would generate a new question
+      echo=True # Echo the prompt back in the output
+) # Generate a completion, can also call create_completion
+print(output)
+```