flashvenom commited on
Commit
2a9bc26
1 Parent(s): daa1827

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -1,3 +1,3 @@
1
- Model upload coming soon, source model from https://huggingface.co/Peeepy/Airoboros-13b-SuperHOT-8k.
2
 
3
  You will need a monkey-patch at inference to use the 8k context, please see patch file present, if you are using a different inference engine (like llama.cpp / exllama) you will need to add the monkey patch there.
 
1
+ Model upload in 4-bit GPTQ version, converted using GPTQ-for-LLaMa; Source model from https://huggingface.co/Peeepy/Airoboros-13b-SuperHOT-8k.
2
 
3
  You will need a monkey-patch at inference to use the 8k context, please see patch file present, if you are using a different inference engine (like llama.cpp / exllama) you will need to add the monkey patch there.