Quantization made by Richard Erkhov.

Llama3-1B-Base - GGUF

Model creator: https://huggingface.co/andrijdavid/
Original model: https://huggingface.co/andrijdavid/Llama3-1B-Base/

Name	Quant method	Size
Llama3-1B-Base.Q2_K.gguf	Q2_K	0.79GB
Llama3-1B-Base.IQ3_XS.gguf	IQ3_XS	0.87GB
Llama3-1B-Base.IQ3_S.gguf	IQ3_S	0.88GB
Llama3-1B-Base.Q3_K_S.gguf	Q3_K_S	0.88GB
Llama3-1B-Base.IQ3_M.gguf	IQ3_M	0.89GB
Llama3-1B-Base.Q3_K.gguf	Q3_K	0.91GB
Llama3-1B-Base.Q3_K_M.gguf	Q3_K_M	0.91GB
Llama3-1B-Base.Q3_K_L.gguf	Q3_K_L	0.94GB
Llama3-1B-Base.IQ4_XS.gguf	IQ4_XS	0.99GB
Llama3-1B-Base.Q4_0.gguf	Q4_0	1.03GB
Llama3-1B-Base.IQ4_NL.gguf	IQ4_NL	1.03GB
Llama3-1B-Base.Q4_K_S.gguf	Q4_K_S	1.03GB
Llama3-1B-Base.Q4_K.gguf	Q4_K	1.04GB
Llama3-1B-Base.Q4_K_M.gguf	Q4_K_M	1.04GB
Llama3-1B-Base.Q4_1.gguf	Q4_1	1.1GB
Llama3-1B-Base.Q5_0.gguf	Q5_0	1.16GB
Llama3-1B-Base.Q5_K_S.gguf	Q5_K_S	1.16GB
Llama3-1B-Base.Q5_K.gguf	Q5_K	1.17GB
Llama3-1B-Base.Q5_K_M.gguf	Q5_K_M	1.17GB
Llama3-1B-Base.Q5_1.gguf	Q5_1	1.23GB
Llama3-1B-Base.Q6_K.gguf	Q6_K	1.31GB
Llama3-1B-Base.Q8_0.gguf	Q8_0	1.69GB

Original model description:

license: cc-by-4.0 language: - en pipeline_tag: text-generation

Llama-3-1B-Base

Llama3-1b is a trimmed version of the official Llama-3 8B base model from Meta. It has been reduced in size to ~1 billion parameters, making it more computationally efficient while still retaining a significant portion of the original model's capabilities. This model is intended to serve as a base model and has not been further fine-tuned for any specific task. It is specifically designed to bring the power of LLMs (Large Language Models) to environments with limited computational resources. This model offers a balance between performance and resource usage, serving as an efficient alternative for users who cannot leverage the larger, resource-intensive versions from Meta.

Important: This project is not affiliated with Meta.

Uses

This model can be fine-tuned for a variety of natural language processing tasks, including:

Text generation
Question answering
Sentiment analysis
Translation
Summarization

Bias, Risks, and Limitations

While Llama3-1b is a powerful model, it is important to be aware of its limitations and potential biases. As with any language model, this model may generate outputs that are factually incorrect or biased. It is also possible that the model may produce offensive or inappropriate content. Users and Developers should be aware of these risks and take appropriate measures to mitigate them.

How to Use

To use Llama3-1b, you can load the model using the Hugging Face Transformers library in Python:

from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("andrijdavid/Llama-3-1B-Base/")
model = AutoModelForCausalLM.from_pretrained("andrijdavid/Llama-3-1B-Base/")