teknium commited on
Commit
b7c3ec5
1 Parent(s): 89668df

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -0
README.md CHANGED
@@ -72,6 +72,24 @@ or
72
  <leave a newline blank for model to respond>
73
  ```
74
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
75
  BigBench:
76
  ```
77
  | Task |Version| Metric |Value | |Stderr|
@@ -95,6 +113,7 @@ BigBench:
95
  |bigbench_tracking_shuffled_objects_five_objects | 0|multiple_choice_grade|0.1944|± |0.0112|
96
  |bigbench_tracking_shuffled_objects_seven_objects| 0|multiple_choice_grade|0.1497|± |0.0085|
97
  |bigbench_tracking_shuffled_objects_three_objects| 0|multiple_choice_grade|0.4067|± |0.0284|
 
98
  ```
99
 
100
  AGIEval
@@ -117,6 +136,7 @@ AGIEval
117
  | | |acc_norm|0.3447|± |0.0332|
118
  |agieval_sat_math | 0|acc |0.2500|± |0.0293|
119
  | | |acc_norm|0.2364|± |0.0287|
 
120
  ```
121
 
122
  ## Benchmark Results
 
72
  <leave a newline blank for model to respond>
73
  ```
74
 
75
+ GPT4All:
76
+ ```| Task |Version| Metric |Value | |Stderr|
77
+ |-------------|------:|--------|-----:|---|-----:|
78
+ |arc_challenge| 0|acc |0.4735|± |0.0146|
79
+ | | |acc_norm|0.5017|± |0.0146|
80
+ |arc_easy | 0|acc |0.7946|± |0.0083|
81
+ | | |acc_norm|0.7605|± |0.0088|
82
+ |boolq | 1|acc |0.8000|± |0.0070|
83
+ |hellaswag | 0|acc |0.5924|± |0.0049|
84
+ | | |acc_norm|0.7774|± |0.0042|
85
+ |openbookqa | 0|acc |0.3600|± |0.0215|
86
+ | | |acc_norm|0.4660|± |0.0223|
87
+ |piqa | 0|acc |0.7889|± |0.0095|
88
+ | | |acc_norm|0.7976|± |0.0094|
89
+ |winogrande | 0|acc |0.6993|± |0.0129|
90
+ Average: 0.686
91
+ ```
92
+
93
  BigBench:
94
  ```
95
  | Task |Version| Metric |Value | |Stderr|
 
113
  |bigbench_tracking_shuffled_objects_five_objects | 0|multiple_choice_grade|0.1944|± |0.0112|
114
  |bigbench_tracking_shuffled_objects_seven_objects| 0|multiple_choice_grade|0.1497|± |0.0085|
115
  |bigbench_tracking_shuffled_objects_three_objects| 0|multiple_choice_grade|0.4067|± |0.0284|
116
+ Average: 0.3525
117
  ```
118
 
119
  AGIEval
 
136
  | | |acc_norm|0.3447|± |0.0332|
137
  |agieval_sat_math | 0|acc |0.2500|± |0.0293|
138
  | | |acc_norm|0.2364|± |0.0287|
139
+ Average: 0.2975
140
  ```
141
 
142
  ## Benchmark Results