Update README.md
Browse files
README.md
CHANGED
@@ -39,7 +39,11 @@ Use Llama 3 template, model with ALpaca template sometimes halucinates and gener
|
|
39 |
|
40 |
### Quants
|
41 |
|
42 |
-
More quants are comming soon... and I need to redo
|
|
|
|
|
|
|
|
|
|
|
43 |
|
44 |
-
- [GGUF](https://huggingface.co/altomek/RE-70B-AS3D-GGUF)
|
45 |
|
|
|
39 |
|
40 |
### Quants
|
41 |
|
42 |
+
More quants are comming soon... and I need to redo GGUF quants(!) as they do not encode chat template in tokenizer_config... if your inference engine uses chat template from GGUF file you will se halucinations. However they work fine for me in SillyTawern...
|
43 |
+
|
44 |
+
- [GGUF](https://huggingface.co/altomek/RE-70B-AS3D-GGUF) --> TO BE UPDATED!
|
45 |
+
- [3.5 BPW](https://huggingface.co/altomek/RE-70B-AS3D-3.5bpw-EXL2)
|
46 |
+
- [3.75 BPW](https://huggingface.co/altomek/RE-70B-AS3D-3.75bpw-EXL2)
|
47 |
+
- [4 BPW](https://huggingface.co/altomek/RE-70B-AS3D-4bpw-EXL2)
|
48 |
|
|
|
49 |
|