ar08
/

TinyLlama-1.1B-Chat-v1.0-GGUF-LAPTOP

Inference Endpoints

Model card Files Files and versions Community

TinyLlama-1.1B-Chat-v1.0-GGUF-LAPTOP / README.md

ar08's picture

Create README.md

6aab33a verified 6 months ago

|

history blame contribute delete

1.08 kB

	---
	license: mit
	language:
	- en
	---

	## Model Details
	- *Finetuned+Capable for laptop
	### Model Description
	------------
	Capable for run in Low-end laptop

	- Developed by: [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")

	- Finetuned from model [optional]: [Tiny-llama]("https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0/tree/main")



	## Uses
	```python
	from llama_cpp import Llama

	llm = Llama(
	model_path="path/to/llama",
	# n_gpu_layers=-1, # Uncomment to use GPU acceleration
	# seed=1337, # Uncomment to set a specific seed
	# n_ctx=2048, # Uncomment to increase the context window
	)
	output = llm(
	"Q: Name the planets in the solar system? A: ", # Prompt
	max_tokens=32, # Generate up to 32 tokens, set to None to generate up to the end of the context window
	stop=["Q:", "\n"], # Stop generating just before the model would generate a new question
	echo=True # Echo the prompt back in the output
	) # Generate a completion, can also call create_completion
	print(output)
	```