README.md · sourabhdattawad/meta-llama-3-8b-instruct-gguf at main

meta-llama-3-8b-instruct-gguf / README.md

sourabhdattawad

Update README.md

9b3a5e5 verified about 2 months ago

preview code

raw history blame contribute delete

No virus

2.4 kB

	---
	language:
	- en
	pipeline_tag: text-generation
	tags:
	- meta
	- pytorch
	- llama
	- llama-3
	- llama-cpp
	- quantized
	- 8-bit
	- GGUF
	- 8 Billion
	- python
	- instruct
	- google-colab
	inference: false
	model_creator: sourabhdattawad
	model_name: meta-llama-3-8B-instruct-gguf
	quantized_by: sourabhdattawad
	license_name: llama3
	---

	## Usage

	Package installation

	```
	pip install llama-cpp-python "huggingface_hub[cli]"
	```

	Download the model:

	```
	huggingface-cli download sourabhdattawad/meta-llama-3-8b-instruct-gguf meta-llama-3-8b-instruct.Q8_0.gguf --local-dir . --local-dir-use-symlinks False
	```

	```Python
	from llama_cpp import Llama
	llm = Llama(
	model_path="meta-llama-3-8b-instruct.Q8_0.gguf",
	# n_gpu_layers=-1, # Uncomment to use GPU acceleration
	# seed=1337, # Uncomment to set a specific seed
	# n_ctx=2048, # Uncomment to increase the context window
	)
	output = llm(
	"Q: Name the planets in the solar system? A: ", # Prompt
	max_tokens=50, # Generate up to 50 tokens, set to None to generate up to the end of the context window
	stop=["Q:", "\n"], # Stop generating just before the model would generate a new question
	echo=True # Echo the prompt back in the output
	)
	output
	```
	```
	Llama.generate: prefix-match hit

	llama_print_timings: load time = 7770.49 ms
	llama_print_timings: sample time = 100.16 ms / 40 runs ( 2.50 ms per token, 399.35 tokens per second)
	llama_print_timings: prompt eval time = 0.00 ms / 1 tokens ( 0.00 ms per token, inf tokens per second)
	llama_print_timings: eval time = 35214.73 ms / 40 runs ( 880.37 ms per token, 1.14 tokens per second)
	llama_print_timings: total time = 35895.91 ms / 41 tokens
	{'id': 'cmpl-01e2feb3-c0ff-4a6e-8ca4-b8bf2172da01',
	'object': 'text_completion',
	'created': 1713912080,
	'model': 'meta-llama-3-8b-instruct.Q8_0.gguf',
	'choices': [{'text': 'Q: Name the planets in the solar system? A: 1. Mercury, 2. Venus, 3. Earth, 4. Mars, 5. Jupiter, 6. Saturn, 7. Uranus, 8. Neptune.',
	'index': 0,
	'logprobs': None,
	'finish_reason': 'stop'}],
	'usage': {'prompt_tokens': 13, 'completion_tokens': 40, 'total_tokens': 53}}
	```

	## Google Colab

	[https://colab.research.google.com/drive/1vhrCKGzY7KP5mScHNUl7hjmbPsUyj_sj?usp=sharing)](https://colab.research.google.com/drive/1vhrCKGzY7KP5mScHNUl7hjmbPsUyj_sj?usp=sharing)