ludis commited on
Commit
81175e3
1 Parent(s): 7967efa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -1,27 +1,27 @@
1
  ---
2
  datasets:
3
  - PygmalionAI/PIPPA
4
- - ludis/geepeetee4
5
  ---
6
 
7
- just testing for now, qlora merge, several things different between this and the 7b
8
 
9
- ## training
10
 
11
- NousResearch/Llama-2-13b-hf tuned on koishi data (without code subsets) for 1 epoch
12
 
13
- then tuned on pippa for 1 epoch
14
 
15
- then tuned on ludis/geepeetee4 commit 58aabc8 for 1 epoch
16
 
17
- then tuned on limarp (without ponyville, lolicit, all the fallen, and eka's portal subsets) Version 2023-09-08 for 2 epochs
18
 
19
- all metharme format
20
 
21
- ## prompting
22
 
23
- https://rentry.org/tsukasa13b - reccomended prompts and gen settings
24
 
25
- The current model version has been trained on prompts using three different roles, which are denoted by the following tokens: `<|system|>`, `<|user|>` and `<|model|>`.
 
 
26
 
27
- The `<|system|>` prompt can be used to inject out-of-channel information behind the scenes, while the `<|user|>` prompt should be used to indicate user input. The `<|model|>` token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.
 
1
  ---
2
  datasets:
3
  - PygmalionAI/PIPPA
 
4
  ---
5
 
6
+ ## GPTQ
7
 
8
+ gptq quants for ludis/tsukasa-13b-qlora-limarp download the original model except for the .bin files (or download everything and delete the .bin files) then move the contents from whichever quants folder you want to use into the original model folder and run with autogptq
9
 
10
+ ## Prompting
11
 
12
+ https://rentry.org/v43eo - reccomended prompts and gen settings
13
 
14
+ The current model version has been trained on prompts using three different roles, which are denoted by the following tokens: `<|system|>`, `<|user|>` and `<|model|>`.
15
 
16
+ The `<|system|>` prompt can be used to inject out-of-channel information behind the scenes, while the `<|user|>` prompt should be used to indicate user input. The `<|model|>` token should then be used to indicate that the model should generate a response. These tokens can happen multiple times and be chained up to form a conversation history.
17
 
18
+ ## Training
19
 
20
+ base model (llama-2-13b-hf)
21
 
22
+ then tuned on pippa for 1 epoch
23
 
24
+ then tuned on ludis/geepeetee4 commit 58aabc8 for 1 epoch
25
+
26
+ then tuned on limarp (without ponyville, lolicit, all the fallen, and eka's portal subsets) Version 2023-09-08 for 2 epochs
27