dfurman
/

LLaMA-13B

@@ -3,12 +3,11 @@ pipeline_tag: text-generation
 license: other
 ---
-# 🚀 LLaMA-13B
 LLaMA-13B is a base model for text generation. It was built and released by Meta AI alongside "[LLaMA: Open and Efficient Foundation Language Models](https://arxiv.org/abs/2302.13971)".
-This model repo was converted to work with the Hugging Face transformers package. It is under a bespoke **non-commercial** license, please see the LICENSE file for more details.
 ## Model Summary
@@ -23,15 +22,13 @@ Questions and comments about LLaMA can be sent via the [GitHub repository](https
 ## Intended use
 **Primary intended uses**
-The primary use of LLaMA is research on large language models, including:
-exploring potential applications such as question answering, natural language understanding or reading comprehension, understanding capabilities and limitations of current language models, and developing techniques to improve those,
-evaluating and mitigating biases, risks, toxic and harmful content generations, hallucinations.
 **Primary intended users**
 The primary intended users of the model are researchers in natural language processing, machine learning and artificial intelligence.
 **Out-of-scope use cases**
-LLaMA is a foundation model (a base model). As such, it should not be used on downstream applications without further risk evaluation and mitigation. In particular, the model has not been trained with human feedback, and can thus generate toxic or offensive content, incorrect information or generally unhelpful answers.
 ## Factors
 **Relevant factors**
@@ -60,12 +57,11 @@ LLaMA is a foundational model, and as such, it should not be used for downstream
 ### Setup
 ```python
-# Install packages
 !pip install -q -U transformers accelerate torch
 ```
 ### GPU Inference in fp16
-This requires a GPU with at least xxGB of VRAM.
 ### First, Load the Model

 license: other
 ---
+# 🦙 LLaMA-13B
 LLaMA-13B is a base model for text generation. It was built and released by Meta AI alongside "[LLaMA: Open and Efficient Foundation Language Models](https://arxiv.org/abs/2302.13971)".
+This model repo was converted to work with the transformers package. It is under a bespoke **non-commercial** license, please see the LICENSE file for more details.
 ## Model Summary
 ## Intended use
 **Primary intended uses**
+The primary use of LLaMA is research on large language models, including: exploring potential applications such as question answering, natural language understanding or reading comprehension, understanding capabilities and limitations of current language models, and developing techniques to improve those, evaluating and mitigating biases, risks, toxic and harmful content generations, and hallucinations.
 **Primary intended users**
 The primary intended users of the model are researchers in natural language processing, machine learning and artificial intelligence.
 **Out-of-scope use cases**
+LLaMA is a base model, also known as a foundation model. As such, it should not be used on downstream applications without further risk evaluation, mitigation, and potential further fine-tuning. In particular, the model has not been trained with human feedback, and can thus generate toxic or offensive content, incorrect information or generally unhelpful answers.
 ## Factors
 **Relevant factors**
 ### Setup
 ```python
 !pip install -q -U transformers accelerate torch
 ```
 ### GPU Inference in fp16
+This requires a GPU with at least 15GB of VRAM.
 ### First, Load the Model