shaowenchen commited on
Commit
a172f59
1 Parent(s): 33b5f22

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -41,6 +41,14 @@ tags:
41
  | llama-2-7b-langchain-chat.Q8_0.gguf | Q8_0 | 6.7 GB |
42
  | llama-2-7b-langchain-chat.gguf | full | 13 GB |
43
 
 
 
 
 
 
 
 
 
44
  ## Provided images
45
 
46
  | Name | Quant method | Size |
 
41
  | llama-2-7b-langchain-chat.Q8_0.gguf | Q8_0 | 6.7 GB |
42
  | llama-2-7b-langchain-chat.gguf | full | 13 GB |
43
 
44
+ Usage:
45
+
46
+ ```
47
+ docker run --rm -it -p 8000:8000 -v /path/to/models:/models -e MODEL=/models/gguf-model-name.gguf hubimage/llama-cpp-python:latest
48
+ ```
49
+
50
+ and you can view http://localhost:8000/docs to see the swagger UI.
51
+
52
  ## Provided images
53
 
54
  | Name | Quant method | Size |