Granther
/

Gemma-2-9B-Instruct-4Bit-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Granther commited on Jul 8

Commit

fb3139a

•

1 Parent(s): da79090

Update README.md

Files changed (1) hide show

README.md +25 -3

README.md CHANGED Viewed

@@ -1,3 +1,25 @@
----
-license: mit
----

+---
+base_model: google/gemma-2-9b-it
+inference: false
+license: apache-2.0
+model_name: Gemma-2-9B-Instruct-4Bit-GPTQ
+pipeline_tag: text-generation
+quantized_by: Granther
+tags:
+- gptq
+---
+# Gemma-2-9B-Instruct-4Bit-GPTQ
+- Original Model: [gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it)
+- Model Creator: [google](https://huggingface.co/google)
+## Quantization
+- This model was quantized with the Auto-GPTQ library
+## Metrics
+| Benchmark  | Metric | Gemma 2 GPTQ | Gemma 2 9B it |
+| ------------------| ---------- | -----------  | -------------- |
+| [PIQA](piqa)      | 0-shot     | 80.52        | 80.79          |
+| [MMLU](mmlu)      | 5-shot     | 52.0         | 50.00          |