Edit model card

Disclaimer

These models are research experiments and may generate incorrect or harmful content. Outputs from these models should not be taken as factual or representative of the views of myself or the model's creator or any other individual.

The creator(s) of these models and I are not responsible for any harm or damage caused by the models outputs.

I did not train these models or have any say in their creation, I merely converted these models from the sources available below. To report issues or concerns, please contact the model maker via the links provided in this README.

Conversions

I have used llama.cpp to convert and quantize each of the models available in this repository. Currently, I have quantized:

Important information related to each model can be found in the links above

Model Architecture Llama 3 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.

Training Data Params Context length GQA Token count Knowledge cutoff
Llama 3 A new mix of publicly available online data. 8B 8k Yes 15T+ March, 2023
70B 8k Yes December, 2023

Llama 3 family of models. Token counts refer to pretraining data only. Both the 8 and 70B versions use Grouped-Query Attention (GQA) for improved inference scalability.

Model Release Date April 18, 2024.

License A custom commercial license is available at: https://llama.meta.com/llama3/license.

Intended Use

Intended Use Cases Llama 3 is intended for commercial and research use in English. Instruction tuned models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks.

Out-of-scope Use in any manner that violates applicable laws or regulations (including trade compliance laws). Use in any other way that is prohibited by the Acceptable Use Policy and Llama 3 Community License. Use in languages other than English**.

**Note: Developers may fine-tune Llama 3 models for languages beyond English provided they comply with the Llama 3 Community License and the Acceptable Use Policy.

Downloads last month
96
GGUF
Model size
8.03B params
Architecture
llama
Inference Examples
Unable to determine this model's library. Check the docs .