ludis
/

tsukasa-13b-qlora-limarp-gptq

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ludis commited on Nov 28, 2023

Commit

81175e3

•

1 Parent(s): 7967efa

Update README.md

Files changed (1) hide show

README.md +12 -12

README.md CHANGED Viewed

@@ -1,27 +1,27 @@
 ---
 datasets:
   - PygmalionAI/PIPPA
-  - ludis/geepeetee4
 ---
-just testing for now, qlora merge, several things different between this and the 7b
-## training
-NousResearch/Llama-2-13b-hf tuned on koishi data (without code subsets) for 1 epoch
-then tuned on pippa for 1 epoch
-then tuned on ludis/geepeetee4 commit 58aabc8 for 1 epoch
-then tuned on limarp (without ponyville, lolicit, all the fallen, and eka's portal subsets) Version 2023-09-08 for 2 epochs
-all metharme format
-## prompting
-https://rentry.org/tsukasa13b - reccomended prompts and gen settings
-The current model version has been trained on prompts using three different roles, which are denoted by the following tokens: `<|system|>`, `<|user|>` and `<|model|>`.
-The `<|system|>` prompt can be used to inject out-of-channel information behind the scenes, while the `<|user|>` prompt should be used to indicate user input. The `<|model|>` token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.

 ---
 datasets:
   - PygmalionAI/PIPPA
 ---
+## GPTQ
+gptq quants for ludis/tsukasa-13b-qlora-limarp download the original model except for the .bin files (or download everything and delete the .bin files) then move the contents from whichever quants folder you want to use into the original model folder and run with autogptq
+## Prompting
+https://rentry.org/v43eo - reccomended prompts and gen settings
+The current model version has been trained on prompts using three different roles, which are denoted by the following tokens: `<|system|>`, `<|user|>` and `<|model|>`.
+The `<|system|>` prompt can be used to inject out-of-channel information behind the scenes, while the `<|user|>` prompt should be used to indicate user input. The `<|model|>` token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.
+## Training
+base model (llama-2-13b-hf)
+then tuned on pippa for 1 epoch
+then tuned on ludis/geepeetee4 commit 58aabc8 for 1 epoch
+then tuned on limarp (without ponyville, lolicit, all the fallen, and eka's portal subsets) Version 2023-09-08 for 2 epochs