Add evaluation results on the autoevaluate--conll2003-sample config and test split of autoevaluate/conll2003-sample
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the autoevaluate--conll2003-sample config and test split of the [autoevaluate/conll2003-sample](https://huggingface.co/datasets/autoevaluate/conll2003-sample) dataset by
@lewtun
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-19f625bb-a07b-4f3a-bec2-d734d6029176-6159).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=autoevaluate/conll2003-sample).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=autoevaluate/conll2003-sample).
@@ -14,25 +14,59 @@ model-index:
|
|
14 |
- name: entity-extraction
|
15 |
results:
|
16 |
- task:
|
17 |
-
name: Token Classification
|
18 |
type: token-classification
|
|
|
19 |
dataset:
|
20 |
name: conll2003
|
21 |
type: conll2003
|
22 |
args: conll2003
|
23 |
metrics:
|
24 |
-
-
|
25 |
-
type: precision
|
26 |
value: 0.8862817854414493
|
27 |
-
|
28 |
-
|
29 |
value: 0.9084908826490659
|
30 |
-
|
31 |
-
|
32 |
value: 0.8972489227709645
|
33 |
-
|
34 |
-
|
35 |
value: 0.9774889986814304
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
36 |
---
|
37 |
|
38 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
14 |
- name: entity-extraction
|
15 |
results:
|
16 |
- task:
|
|
|
17 |
type: token-classification
|
18 |
+
name: Token Classification
|
19 |
dataset:
|
20 |
name: conll2003
|
21 |
type: conll2003
|
22 |
args: conll2003
|
23 |
metrics:
|
24 |
+
- type: precision
|
|
|
25 |
value: 0.8862817854414493
|
26 |
+
name: Precision
|
27 |
+
- type: recall
|
28 |
value: 0.9084908826490659
|
29 |
+
name: Recall
|
30 |
+
- type: f1
|
31 |
value: 0.8972489227709645
|
32 |
+
name: F1
|
33 |
+
- type: accuracy
|
34 |
value: 0.9774889986814304
|
35 |
+
name: Accuracy
|
36 |
+
- task:
|
37 |
+
type: token-classification
|
38 |
+
name: Token Classification
|
39 |
+
dataset:
|
40 |
+
name: autoevaluate/conll2003-sample
|
41 |
+
type: autoevaluate/conll2003-sample
|
42 |
+
config: autoevaluate--conll2003-sample
|
43 |
+
split: test
|
44 |
+
metrics:
|
45 |
+
- type: accuracy
|
46 |
+
value: 0.9680247550283652
|
47 |
+
name: Accuracy
|
48 |
+
verified: true
|
49 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZTgzYzIwNTcyNzgxN2JiNGU0Y2RhMmY2YzRhMzUyNGY5NGE2MDA0NTVmYTFjYzdjMWQ2M2UxOTY4YmJkNWI2OCIsInZlcnNpb24iOjF9.TXZVtZoAvkUw_iXjmVwAdPtzhimwv33pA0BqxbKLGP3QSpJAsFbAbDwh2kUaKH4mTtgmcGgmtsywIgV5_ZEFAA
|
50 |
+
- type: precision
|
51 |
+
value: 0.9708377518557795
|
52 |
+
name: Precision
|
53 |
+
verified: true
|
54 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWJkYzQ0MzhmNGE4Y2UyMmIzNThmMTdlZjMzODdlOWMzMTg1NTEwNWQ3NDMyNTYxODZiMzZhYTQ5NDU2ZGZlMSIsInZlcnNpb24iOjF9.rFvd0bxUagfktMsv-Q0NJr2WN2MuZ74dR0Opq9_MqjXnhi1wPxRcfbjw2RYUKnRM9PVVkBrb3WyTGYljcJYMCA
|
55 |
+
- type: recall
|
56 |
+
value: 0.9754928076718167
|
57 |
+
name: Recall
|
58 |
+
verified: true
|
59 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjkzMGExNzU3NWY4Y2E0ODgyZTU5MzY1NTYxMDU3M2E3N2RkMmEwNzRmNWRmZDA1N2Y3MDQ5OGE3ZWQ3ZDA0NyIsInZlcnNpb24iOjF9.yAlh4o8i2o4GG6TES8-IoYlvqCh8NS09OeQ8yILRiRo8Uk9u6CdaZAklstD60jyMlanP7c_IP-SQsqokJ41tCg
|
60 |
+
- type: f1
|
61 |
+
value: 0.9731597129949509
|
62 |
+
name: F1
|
63 |
+
verified: true
|
64 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNmFiNDdjODdjNGJhYjNiZGUwNzc2OTQ0NDhhMjk5ZTFlMjM4NTE5MTViYTBlYzI2ZTE4MzQ5MmE3MTBiZWU0ZiIsInZlcnNpb24iOjF9.amNItmETm5mBYgwTYkYEO7L7mlO6xxPJhHfy8X8LidtLir8euAUxoj4gLro9-NETDGaZOLLvvjx7SRyODMwrAg
|
65 |
+
- type: loss
|
66 |
+
value: 0.1187286302447319
|
67 |
+
name: loss
|
68 |
+
verified: true
|
69 |
+
verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWFlYThiZGFhYzI4ZjZiNDUyMmQ3ZDVhMGIzZDJhNmU3ZjEwNTU1NTE2YjA3ZjM2NGNlNTA1MmYwNWY4NTdjMiIsInZlcnNpb24iOjF9.qBgBdwqISdVvRHyJQ-8JgqeGGG6J1wrNEcoJiqUgZ8OQIn8FKi6I0xmdBukkoYMapegWqwIGjNVNF4WAsjoyAg
|
70 |
---
|
71 |
|
72 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|