Severian commited on
Commit
0550357
1 Parent(s): d6629d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -12,13 +12,13 @@ At the very core of the development of this model is the desire to make sure tha
12
 
13
  Test this out and see if you find anything interesting or intriguing. I will keep iterating more versions but this one seems like a fun and useful way to start.
14
 
15
- ## Training
16
  ```
17
 
18
  key: str = "system", key2: str = "instruction"
19
 
20
  batch_size=1
21
- epochs=10 (I let it keep going until it finally converged)
22
  r=16
23
  lora_alpha=32
24
  lora_dropout=0.001
 
12
 
13
  Test this out and see if you find anything interesting or intriguing. I will keep iterating more versions but this one seems like a fun and useful way to start.
14
 
15
+ ## Training (Done on the First Draft V1 of the dataset)
16
  ```
17
 
18
  key: str = "system", key2: str = "instruction"
19
 
20
  batch_size=1
21
+ epochs=10 (Don't do this for the current version of the dataset, your model WILL overfit. It's very potent.)
22
  r=16
23
  lora_alpha=32
24
  lora_dropout=0.001