gate369 commited on
Commit
4756708
1 Parent(s): 33e0fc2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -15,14 +15,15 @@ datasets:
15
  - QSTAR
16
  - I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
17
  be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
18
- -Steps: -Loss:
19
- - 356 - 0.014300
20
- - 357 - 0.012400
21
- - 358 - 0.016800
22
- - 359 - 0.022200
23
- - 360 - 0.015000
24
- - 361 - 0.018300
25
- - 362 - 0.016000
 
26
  - 363 - 0.019000
27
  - 364 - 0.017600
28
  - 365 - 0.015600
 
15
  - QSTAR
16
  - I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
17
  be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
18
+
19
+ - Steps: -Loss:
20
+ - 1 - 1.373000
21
+ - 2 - 1.551400
22
+ - 3 - 1.083100
23
+ - 4 - 1.164900
24
+ - 5 - 1.196500
25
+ - 6 - 1.015400
26
+ - ....
27
  - 363 - 0.019000
28
  - 364 - 0.017600
29
  - 365 - 0.015600