Finalised model card
Browse files
README.md
CHANGED
@@ -174,5 +174,9 @@ We highlight the relevant rows for five-digit addition and subtraction for easy
|
|
174 |
|
175 |
</figure>
|
176 |
|
177 |
-
**VISUALISATION OF
|
178 |
|
|
|
|
|
|
|
|
|
|
174 |
|
175 |
</figure>
|
176 |
|
177 |
+
**VISUALISATION OF USECASE BENCHMARK RESULTS**
|
178 |
|
179 |
+
USECASE benchmark results given as percentage change of finetuned model relative to base model.
|
180 |
+
For visualisation purposes we drop "arithmetic_4ds" since results for both models are very small and dominated by standard error.
|
181 |
+
|
182 |
+
![USECASE benchmark results](./eval_results.png)
|