TheBloke
/

Tulu-30B-SuperHOT-8K-fp16

Text Generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jun 26, 2023

Commit

555d5ab

•

1 Parent(s): 0d49c58

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -21,8 +21,9 @@ license: other
 These files are fp16 model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b) merged with [Kaio Ken's SuperHOT 30B 8K LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) to produce a model capable of 8K context.
-[Kaio Ken's SuperHOT 30B LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) is merged on to the base model to produce a model capable of 8K context, via a modified version of the Llama modelling code.
 ## Repositories available

 These files are fp16 model files for [Allen AI's Tulu 30B](https://huggingface.co/allenai/tulu-30b) merged with [Kaio Ken's SuperHOT 30B 8K LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) to produce a model capable of 8K context.
+[Kaio Ken's SuperHOT 30B LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test) is merged on to the base model to produce a model capable of 8K context, via the provided monkey patch (`llama_rope_scaled_monkey_patch.py`)
+Alternatively, `config.json` can be modified to allow the monkey patch to load via trust_remote_code=True. I plan to update this repo shortly to include that method.
 ## Repositories available