tmpupload
/

superhot-30b-8k-no-rlhf-test-GGML

Model card Files Files and versions Community

tmpupload commited on Jun 27, 2023

Commit

4250648

•

1 Parent(s): fb6f09d

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -1,5 +1,10 @@
 # superhot-30b-8k-no-rlhf-test-GGML
 Merged base LLaMA and LoRA with this:
 https://github.com/tloen/alpaca-lora
@@ -18,4 +23,4 @@ Converted and quantized with llama.cpp commit `447ccbe`:
 ``` sh
 python convert.py superhot-30b-8k-safetensors --outtype f32 --outfile superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin
 ./bin/quantize superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin superhot-30b-8k-no-rlhf-test.ggmlv3.Q2_K.bin Q2_K
-```

+---
+license: other
+---
 # superhot-30b-8k-no-rlhf-test-GGML
+**Note: LLAMA_ROPE_SCALE from PR [#1967](https://github.com/ggerganov/llama.cpp/pull/1967) needs to be set to 0.25**
 Merged base LLaMA and LoRA with this:
 https://github.com/tloen/alpaca-lora
 ``` sh
 python convert.py superhot-30b-8k-safetensors --outtype f32 --outfile superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin
 ./bin/quantize superhot-30b-8k-no-rlhf-test.ggmlv3.f32.bin superhot-30b-8k-no-rlhf-test.ggmlv3.Q2_K.bin Q2_K
+```