SpiridonSunRotator
commited on
Commit
•
395071e
1
Parent(s):
51225fb
Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ tags:
|
|
11 |
Official [AQLM](https://arxiv.org/abs/2401.06118) quantization of [meta-llama/Meta-Llama-3.1-8B
|
12 |
](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) finetuned with [PV-Tuning](https://arxiv.org/abs/2405.14852).
|
13 |
|
14 |
-
For this quantization, we used 1 codebook of 16 bits and groupsize of
|
15 |
|
16 |
Results:
|
17 |
| Model | Quantization | MMLU (5-shot) | ArcC| ArcE| Hellaswag | PiQA | Winogrande | Model size, Gb |
|
|
|
11 |
Official [AQLM](https://arxiv.org/abs/2401.06118) quantization of [meta-llama/Meta-Llama-3.1-8B
|
12 |
](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) finetuned with [PV-Tuning](https://arxiv.org/abs/2405.14852).
|
13 |
|
14 |
+
For this quantization, we used 1 codebook of 16 bits and groupsize of 16.
|
15 |
|
16 |
Results:
|
17 |
| Model | Quantization | MMLU (5-shot) | ArcC| ArcE| Hellaswag | PiQA | Winogrande | Model size, Gb |
|