Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
datasets:
|
3 |
+
- adamo1139/basic_economics_questions_ts_test_1
|
4 |
+
---
|
5 |
+
QLORA on SpicyBoros 2.2 Llama 7B v2 using synthetic Q&A Dataset.
|
6 |
+
a little bit under one epoch, since my GTX1080 decided to OOM a tiny bit before training end and I am using checkpoint made at 450/465 step.
|
7 |
+
I've been running into a lot of issues, so I am happy to even get that far, most of my QLORA attempts had loss go to 0 and deepspeed was forcibly closing training after roughly 0.3 epoch.
|
8 |
+
|
9 |
+
My intention with this QLORA is mostly to try to train something usable and cool locally on normal desktop without going to runpod.
|
10 |
+
I tried training q4_0 quant with cpu-lora in llama.cpp (https://rentry.org/cpu-lora) but it's been a miss, it's about 20x slower on 11400f than on poorman's GTX 1080.
|
11 |
+
|
12 |
+
The model can be used to ask questions about basic economic concepts, responses will have a viewpoint similar to the one expressed by Thomas Sowell in his book Basic Economics.
|
13 |
+
|
14 |
+
Prompt format:
|
15 |
+
|
16 |
+
Reader: {prompt}
|
17 |
+
'\nThomas:\n' {response}
|