Update README.md
Browse files
README.md
CHANGED
@@ -49,6 +49,7 @@ print(result)
|
|
49 |
|
50 |
## lm-eval benchmark:
|
51 |
|
|
|
52 |
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
53 |
|---------------------------------------|------:|------|-----:|----------|---|-----:|---|-----:|
|
54 |
|arc_challenge | 1|none | 0|acc |↑ |0.6186|± |0.0142|
|
@@ -136,4 +137,5 @@ print(result)
|
|
136 |
| - humanities | 1|none | |acc |↑ |0.7981|± |0.0057|
|
137 |
| - other | 1|none | |acc |↑ |0.8304|± |0.0064|
|
138 |
| - social sciences| 1|none | |acc |↑ |0.8736|± |0.0059|
|
139 |
-
| - stem | 1|none | |acc |↑ |0.7456|± |0.0075|
|
|
|
|
49 |
|
50 |
## lm-eval benchmark:
|
51 |
|
52 |
+
```
|
53 |
| Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr|
|
54 |
|---------------------------------------|------:|------|-----:|----------|---|-----:|---|-----:|
|
55 |
|arc_challenge | 1|none | 0|acc |↑ |0.6186|± |0.0142|
|
|
|
137 |
| - humanities | 1|none | |acc |↑ |0.7981|± |0.0057|
|
138 |
| - other | 1|none | |acc |↑ |0.8304|± |0.0064|
|
139 |
| - social sciences| 1|none | |acc |↑ |0.8736|± |0.0059|
|
140 |
+
| - stem | 1|none | |acc |↑ |0.7456|± |0.0075|
|
141 |
+
```
|