readme: fix table
Browse files
README.md
CHANGED
@@ -59,9 +59,9 @@ Evaluation is performed with SpanMarkers internal evaluation code that uses `seq
|
|
59 |
We fine-tune 5 models and upload the model with best F1-Score on development set. Results on development set are
|
60 |
in brackets:
|
61 |
|
62 |
-
| Model
|
63 |
-
|
|
64 |
-
| GWLMS Token Dropping BERT
|
65 |
|
66 |
The best model achieves a final test score of 87.44%.
|
67 |
|
|
|
59 |
We fine-tune 5 models and upload the model with best F1-Score on development set. Results on development set are
|
60 |
in brackets:
|
61 |
|
62 |
+
| Model | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Avg.
|
63 |
+
| ------------------------- | --------------- | ------------------- | --------------- | --------------- | --------------- | ---------------
|
64 |
+
| GWLMS Token Dropping BERT | (87.85) / 87.28 | (**88.09**) / 87.44 | (87.59) / 87.26 | (87.71) / 87.43 | (87.83) / 87.24 | (87.81) / 87.33
|
65 |
|
66 |
The best model achieves a final test score of 87.44%.
|
67 |
|