samos123 commited on
Commit
406820d
1 Parent(s): 217fce2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -10
README.md CHANGED
@@ -1,14 +1,4 @@
1
  ---
2
- extra_gated_heading: Access Llama 2 on Hugging Face
3
- extra_gated_description: >-
4
- This is a form to enable access to Llama 2 on Hugging Face after you have been
5
- granted access from Meta. Please visit the [Meta website](https://ai.meta.com/resources/models-and-libraries/llama-downloads) and accept our
6
- license terms and acceptable use policy before submitting this form. Requests
7
- will be processed in 1-2 days.
8
- extra_gated_prompt: "**Your Hugging Face account email address MUST match the email you provide on the Meta website, or your request will not be approved.**"
9
- extra_gated_button_content: Submit
10
- extra_gated_fields:
11
- I agree to share my name, email address and username with Meta and confirm that I have already been granted download access on the Meta website: checkbox
12
  language:
13
  - en
14
  pipeline_tag: text-generation
@@ -21,6 +11,22 @@ tags:
21
  - llama
22
  - llama-2
23
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
  # **Llama 2**
25
  Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format. Links to other models can be found in the index at the bottom.
26
 
 
1
  ---
 
 
 
 
 
 
 
 
 
 
2
  language:
3
  - en
4
  pipeline_tag: text-generation
 
11
  - llama
12
  - llama-2
13
  ---
14
+ # Llama 2 7B Chat GGUF version
15
+
16
+ Files provided:
17
+
18
+ | File | Description |
19
+ | -------- | ------- |
20
+ | llama2-7b-chat-Q4_K_M.gguf | Quantised GGUF model using Q4_K_M |
21
+ | llama2-7b-chat-Q5_K_S.gguf | Quantised GGUF model using Q5_K_S |
22
+ | llama2-7b-chat-Q6_K.gguf | Quantised GGUF model using Q6_K |
23
+ | llama2-7b-chat-f16.gguf | Original float16 format which can be used for further quantisation |
24
+
25
+ Need help? Join the [Substratus discord server](https://discord.gg/JeXhcmjZVm).
26
+
27
+
28
+ Original README:
29
+
30
  # **Llama 2**
31
  Llama 2 is a collection of pretrained and fine-tuned generative text models ranging in scale from 7 billion to 70 billion parameters. This is the repository for the 7B fine-tuned model, optimized for dialogue use cases and converted for the Hugging Face Transformers format. Links to other models can be found in the index at the bottom.
32