kewin4933
/

InferLLM-Model

Model card Files Files and versions Community

kewin4933 commited on Aug 16, 2023

Commit

2e86dee

•

1 Parent(s): 4430239

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -13,3 +13,5 @@ the two models also can be loaded by the [llama.cpp](https://github.com/ggergano
 InferLLM support the ChatGLM/ChatGLM2 model, the chatglm-q4/bin/chatglm2-q4.bin is the int4 quantized model from [chatglm-6b](https://huggingface.co/THUDM/chatglm-6b)/[chatglm2-6b](https://huggingface.co/THUDM/chatglm2-6b)
 InferLLM support the baichuan model, the baichuan-q4 is the int4 quantized model from [baichuan](https://huggingface.co/fireballoon/baichuan-vicuna-7b)

 InferLLM support the ChatGLM/ChatGLM2 model, the chatglm-q4/bin/chatglm2-q4.bin is the int4 quantized model from [chatglm-6b](https://huggingface.co/THUDM/chatglm-6b)/[chatglm2-6b](https://huggingface.co/THUDM/chatglm2-6b)
 InferLLM support the baichuan model, the baichuan-q4 is the int4 quantized model from [baichuan](https://huggingface.co/fireballoon/baichuan-vicuna-7b)
+InferLLM support the llama2 model, the llama2-q4 is the int4 quantized model from [llama2](https://huggingface.co/meta-llama/Llama-2-7b-chat-hf)