davidhornshaw
commited on
Commit
•
0b59b09
1
Parent(s):
f2f5d41
Update README.md
Browse files
README.md
CHANGED
@@ -133,7 +133,7 @@ Evaluation results:
|
|
133 |
|arithmetic_4da| 1 | none | 0 |acc |↑ | 0.0675|± | 0.0056|
|
134 |
|arithmetic_4ds| 1 | none | 0 |acc |↑ | 0.0010|± | 0.0007|
|
135 |
|**arithmetic_5da**| 1 | none | 0 |acc |↑ | **0.3720**|± | **0.0108**|
|
136 |
-
|
137 |
|asdiv | 1 | none | 0 |acc |↑ | 0.0187|± | 0.0028|
|
138 |
<figcaption>Collected USECASE benchmarks results for the base model.</figcaption>
|
139 |
|
@@ -168,7 +168,7 @@ Evaluation results:
|
|
168 |
|arithmetic_4da| 1 | none | 0 |acc |↑ | 0.0710|± | 0.0057|
|
169 |
|arithmetic_4ds| 1 | none | 0 |acc |↑ | 0.0005|± | 0.0005|
|
170 |
|**arithmetic_5da**| 1 | none | 0 |acc |↑ | **0.4005**|± | **0.0110**|
|
171 |
-
|
172 |
|asdiv | 1 | none | 0 |acc |↑ | 0.0204|± | 0.0029|
|
173 |
<figcaption>Collected USECASE benchmarks results for the finetuned model.</figcaption>
|
174 |
|
|
|
133 |
|arithmetic_4da| 1 | none | 0 |acc |↑ | 0.0675|± | 0.0056|
|
134 |
|arithmetic_4ds| 1 | none | 0 |acc |↑ | 0.0010|± | 0.0007|
|
135 |
|**arithmetic_5da**| 1 | none | 0 |acc |↑ | **0.3720**|± | **0.0108**|
|
136 |
+
|*arithmetic_5ds*| 1 | none | 0 |acc |↑ | *0.0260*|± | *0.0036*|
|
137 |
|asdiv | 1 | none | 0 |acc |↑ | 0.0187|± | 0.0028|
|
138 |
<figcaption>Collected USECASE benchmarks results for the base model.</figcaption>
|
139 |
|
|
|
168 |
|arithmetic_4da| 1 | none | 0 |acc |↑ | 0.0710|± | 0.0057|
|
169 |
|arithmetic_4ds| 1 | none | 0 |acc |↑ | 0.0005|± | 0.0005|
|
170 |
|**arithmetic_5da**| 1 | none | 0 |acc |↑ | **0.4005**|± | **0.0110**|
|
171 |
+
|*arithmetic_5ds*| 1 | none | 0 |acc |↑ | *0.0285*|± | *0.0037*|
|
172 |
|asdiv | 1 | none | 0 |acc |↑ | 0.0204|± | 0.0029|
|
173 |
<figcaption>Collected USECASE benchmarks results for the finetuned model.</figcaption>
|
174 |
|