Download the Llama 2 Model:
Download the Llama 2 model file named llama-2-7b-chat.ggmlv3.q4_0.bin
from the following link:
Llama 2 Model Information
Name | Quant method | Bits | Size | Max RAM required |
---|---|---|---|---|
llama-2-7b-chat.ggmlv3.q4_0.bin | q4_0 | 4 | 3.79 GB | 6.29 GB |
Note: After downloading the model, add the model file to the models
directory. The file should be located at models\llama-2-7b-chat.ggmlv3.q4_0.bin
, in order to run the code.