Add 4.0 link
Browse files
README.md
CHANGED
@@ -22,6 +22,8 @@ Conversion was done using wikitext-103-raw-v1-test.parquet as calibration datase
|
|
22 |
Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
|
23 |
|
24 |
Original model: https://huggingface.co/openaccess-ai-collective/DPOpenHermes-11B
|
|
|
|
|
25 |
|
26 |
<a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/6_5">6.5 bits per weight</a>
|
27 |
|
|
|
22 |
Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
|
23 |
|
24 |
Original model: https://huggingface.co/openaccess-ai-collective/DPOpenHermes-11B
|
25 |
+
|
26 |
+
<a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/4_0">4.0 bits per weight</a>
|
27 |
|
28 |
<a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/6_5">6.5 bits per weight</a>
|
29 |
|