ccore commited on
Commit
6da04b8
·
1 Parent(s): da588b8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -1
README.md CHANGED
@@ -62,6 +62,30 @@ in my last tests with length 2048, I got great models, I trained models in 24 ho
62
  In training, step 2/6
63
  Each stage lasts 4-6 hours.
64
  I am releasing the partial models, in the end I will also release the datasets. 100% synthetic data in markdown
65
- 1 - OK
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
66
  2 - RUNNING - next upload 9/9 - 00:30 GMT
67
  3 -
 
62
  In training, step 2/6
63
  Each stage lasts 4-6 hours.
64
  I am releasing the partial models, in the end I will also release the datasets. 100% synthetic data in markdown
65
+ 1 - 2.5h OK result :
66
+ (if you have problems on eval, set same max_length)
67
+ | Task |Version|Metric|Value | |Stderr|
68
+ |----------|------:|------|-----:|---|-----:|
69
+ |winogrande| 0|acc |0.5162|± | 0.014|
70
+
71
+ hf-causal (max_length=3200), limit: None, provide_description: False, num_fewshot: 0, batch_size: None
72
+ | Task |Version| Metric |Value | |Stderr|
73
+ |----------|------:|--------|-----:|---|-----:|
74
+ |openbookqa| 0|acc |0.1380|± |0.0154|
75
+ | | |acc_norm|0.3420|± |0.0212|
76
+ |piqa | 0|acc |0.6289|± |0.0113|
77
+ | | |acc_norm|0.6251|± |0.0113|
78
+
79
+ hf-causal (max_length=1280), limit: None, provide_description: False, num_fewshot: 0, batch_size: None
80
+ | Task |Version| Metric |Value | |Stderr|
81
+ |-------------|------:|--------|-----:|---|-----:|
82
+ |arc_challenge| 0|acc |0.1903|± |0.0115|
83
+ | | |acc_norm|0.2270|± |0.0122|
84
+ |hellaswag | 0|acc |0.2892|± |0.0045|
85
+ | | |acc_norm|0.3114|± |0.0046|
86
+
87
+
88
+
89
+
90
  2 - RUNNING - next upload 9/9 - 00:30 GMT
91
  3 -