dahara1
/

ELYZA-japanese-Llama-2-7b-fast-instruct-GPTQ

Text Generation

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Sep 10, 2023

Commit

dfa1bb5

•

1 Parent(s): 95181cd

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -26,7 +26,11 @@ You need [autoGPTQ](https://github.com/PanQiWei/AutoGPTQ) library to use this mo
 ## Other Quantized Model
-There are two [llama.cpp](https://github.com/ggerganov/llama.cpp) version quantized model.
 If you want to run it in a CPU-only environment, you may want to check this.
 (1)[mmnga's gguf version](https://huggingface.co/mmnga/ELYZA-japanese-Llama-2-7b-fast-instruct-gguf)

 ## Other Quantized Model
+### New!
+[dahara1/ELYZA-japanese-Llama-2-7b-instruct-AWQ](https://huggingface.co/dahara1/ELYZA-japanese-Llama-2-7b-instruct-AWQ) is newly published.
+The awq model has improved ability to follow instructions, so please try it.
+There are another two [llama.cpp](https://github.com/ggerganov/llama.cpp) version quantized model.
 If you want to run it in a CPU-only environment, you may want to check this.
 (1)[mmnga's gguf version](https://huggingface.co/mmnga/ELYZA-japanese-Llama-2-7b-fast-instruct-gguf)