MohamedRashad
/

AceGPT-7B-chat-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

MohamedRashad commited on Nov 18, 2023

Commit

02856c9

•

1 Parent(s): 8491878

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -30,6 +30,10 @@ library_name: transformers
 This repo contains AWQ model files for [FreedomIntelligence's AceGPT 7B Chat](https://huggingface.co/FreedomIntelligence/AceGPT-7B-chat).
 ### About AWQ
 AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.

 This repo contains AWQ model files for [FreedomIntelligence's AceGPT 7B Chat](https://huggingface.co/FreedomIntelligence/AceGPT-7B-chat).
+In my effort of making Arabic LLms Available for consumers with simple GPUs I have Quantized two important models:
+- [AceGPT 13B Chat AWQ](https://huggingface.co/MohamedRashad/AceGPT-13B-chat-AWQ)
+- [AceGPT 7B Chat AWQ](https://huggingface.co/MohamedRashad/AceGPT-7B-chat-AWQ) **(We are Here)**
 ### About AWQ
 AWQ is an efficient, accurate and blazing-fast low-bit weight quantization method, currently supporting 4-bit quantization. Compared to GPTQ, it offers faster Transformers-based inference with equivalent or better quality compared to the most commonly used GPTQ settings.