Add 8.0 link
Browse files
README.md
CHANGED
@@ -27,6 +27,8 @@ Conversion was done using the default calibration dataset.
|
|
27 |
Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
|
28 |
|
29 |
Original model: https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v3
|
|
|
|
|
30 |
|
31 |
## Download instructions
|
32 |
|
|
|
27 |
Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
|
28 |
|
29 |
Original model: https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v3
|
30 |
+
|
31 |
+
<a href="https://huggingface.co/bartowski/Llama-2-7b-chat-hf-function-calling-v3-exl2/tree/8_0">8.0 bits per weight</a>
|
32 |
|
33 |
## Download instructions
|
34 |
|