lora
adamo1139 commited on
Commit
44c1835
1 Parent(s): a3c8315

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - adamo1139/basic_economics_questions_ts_test_1
4
+ ---
5
+ QLORA on SpicyBoros 2.2 Llama 7B v2 using synthetic Q&A Dataset.
6
+ a little bit under one epoch, since my GTX1080 decided to OOM a tiny bit before training end and I am using checkpoint made at 450/465 step.
7
+ I've been running into a lot of issues, so I am happy to even get that far, most of my QLORA attempts had loss go to 0 and deepspeed was forcibly closing training after roughly 0.3 epoch.
8
+
9
+ My intention with this QLORA is mostly to try to train something usable and cool locally on normal desktop without going to runpod.
10
+ I tried training q4_0 quant with cpu-lora in llama.cpp (https://rentry.org/cpu-lora) but it's been a miss, it's about 20x slower on 11400f than on poorman's GTX 1080.
11
+
12
+ The model can be used to ask questions about basic economic concepts, responses will have a viewpoint similar to the one expressed by Thomas Sowell in his book Basic Economics.
13
+
14
+ Prompt format:
15
+
16
+ Reader: {prompt}
17
+ '\nThomas:\n' {response}