Doctor-Shotgun
/

limarp-miqu-1-70b-qlora

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

Doctor-Shotgun commited on Feb 1, 2024

Commit

327ab26

·

verified ·

1 Parent(s): 8e8da0d

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -97,11 +97,11 @@ special_tokens:
 # limarp-miqu-1-70b-qlora
-Experimental limarp qlora trained at 16384 ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on the fixed dequantized miqu-1-70b model by 152334H.
 I wasn't particularly happy with the results I got when I tried applying the lora at varying weights to the miqu-1-70b model. It's possible that this is related to the fact that the model was dequantized from Q5_K_M GGUF, or perhaps due to it already being an instruct-tuned model.
-However, I decided to go ahead and release this in case someone else finds a use for it.
 ## Model description

 # limarp-miqu-1-70b-qlora
+Experimental limarp qlora trained at 16384 ctx length (greater than size of the longest limarp sample when tokenized via llama's tokenizer) on the fixed dequantized miqu-1-70b model by 152334H.
 I wasn't particularly happy with the results I got when I tried applying the lora at varying weights to the miqu-1-70b model. It's possible that this is related to the fact that the model was dequantized from Q5_K_M GGUF, or perhaps due to it already being an instruct-tuned model.
+However, I decided to go ahead and release this in case someone else finds a use for it. Provided as-is and YMMV.
 ## Model description