Text Generation
Transformers
English
Inference Endpoints
bartowski commited on
Commit
4523e93
1 Parent(s): 0dcca9b

Add 4.0 link

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -22,6 +22,8 @@ Conversion was done using wikitext-103-raw-v1-test.parquet as calibration datase
22
  Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
23
 
24
  Original model: https://huggingface.co/openaccess-ai-collective/DPOpenHermes-11B
 
 
25
 
26
  <a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/6_5">6.5 bits per weight</a>
27
 
 
22
  Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
23
 
24
  Original model: https://huggingface.co/openaccess-ai-collective/DPOpenHermes-11B
25
+
26
+ <a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/4_0">4.0 bits per weight</a>
27
 
28
  <a href="https://huggingface.co/bartowski/DPOpenHermes-11B-exl2/tree/6_5">6.5 bits per weight</a>
29