File size: 539 Bytes
4bf0b78 fbf041c fdecbc9 a1ea854 fdecbc9 a1ea854 4bf0b78 fbf041c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 |
---
license: creativeml-openrail-m
tags:
- text generation
- conversational
- gptq
- 4bit
inference: false
language:
- en
pipeline_tag: text-generation
---
GPTQ quantization of https://huggingface.co/PygmalionAI/pygmalion-6b/commit/b8344bb4eb76a437797ad3b19420a13922aaabe1
Using this repository: https://github.com/mayaeary/GPTQ-for-LLaMa/tree/gptj-v2
Command:
```
python3 gptj.py models/pygmalion-6b_b8344bb4eb76a437797ad3b19420a13922aaabe1 c4 --wbits 4 --groupsize 128 --save_safetensors models/pygmalion-6b-4bit-128g.safetensors
``` |