amanpreetsingh459 commited on
Commit
33c304c
·
1 Parent(s): 5fdd3c2

Add instructions to run the model

Browse files
Files changed (1) hide show
  1. README.md +18 -4
README.md CHANGED
@@ -2,10 +2,24 @@
2
  license: mit
3
  ---
4
 
5
- # amanpreetsingh459/llama-2-7b-chat_q4_quantized_cpp
6
- - This model contains the 4-bit quantized version of [llama2](https://github.com/facebookresearch/llama) model.
7
- - This can be run on a local cpu system as a cpp module available at: [https://github.com/ggerganov/llama.cpp](https://github.com/ggerganov/llama.cpp)
8
- - As for the testing, the model has been tested on `Ubuntu Linux` os with `12 GB RAM` and `core i5 processor`.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
  # Credits:
11
  1. https://github.com/facebookresearch/llama
 
2
  license: mit
3
  ---
4
 
5
+ # llama-2-7b-chat_q4_quantized_cpp
6
+ - This model contains the 4-bit quantized version of [llama2](https://github.com/facebookresearch/llama) model in cpp.
7
+ - This can be run on a local cpu system as a cpp module *(instructions for the same are given below)*
8
+ - As for the testing, the model has been tested on `Linux(Ubuntu)` os with `12 GB RAM` and `core i5 processor`.
9
+
10
+ # Usage:
11
+ 1. Clone the llama C++ repository from github:<br>
12
+ `git clone https://github.com/ggerganov/llama.cpp.git`
13
+ 2. Enter the **llama.cpp** repository(which was downloaded in the step 1) and build it by running the **make** command<br>
14
+ `cd llama.cpp` <br>
15
+ `make`
16
+ 3. Create a directory names **7B** under the directory **llama.cpp/models** and put the model file **ggml-model-q4_0.bin** under this newly created **7B** directory<br>
17
+ `cd models` <br>
18
+ `mkdir 7B`
19
+ 4. Navigate back to **llama.cpp** directory and run the below command:-<br>
20
+ `./main -m ./models/7B/ggml-model-q4_0.bin -n 1024 --repeat_penalty 1.0 --color -i -r "User:" -f ./prompts/alpaca.txt` <br>
21
+ > the initial prompt file can be changed to anything from `prompts/alpaca.txt` to of your choice
22
+ 5. That's it. Enter the desired prompts and let the results surprise you...
23
 
24
  # Credits:
25
  1. https://github.com/facebookresearch/llama