DavidAU
/

Psyfighter2-Ultra-Quality-13B-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on Jun 3, 2024

Commit

c4885c3

·

verified ·

1 Parent(s): 51f3229

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -26,6 +26,14 @@ Reduction in prompt size, as it understands nuance better.
 And as a side effect more context available for output due to reduction in prompt size.
 Special thanks to the original model creator:
 [ https://huggingface.co/KoboldAI/LLaMA2-13B-Psyfighter2 ]

 And as a side effect more context available for output due to reduction in prompt size.
+Note that there will be an outsized difference between quants especially for creative and/or "no right answer" use cases.
+Because of this it is suggested to download the highest quant you can operate, and it's closest neighbours so to speak.
+IE: Q4KS, Q4KM, Q5KS as an example.
+Imatrix Plus versions to be uploaded at a separate repo shortly.
 Special thanks to the original model creator:
 [ https://huggingface.co/KoboldAI/LLaMA2-13B-Psyfighter2 ]