autoevaluator HF staff commited on
Commit
d5fb657
1 Parent(s): e49ab07

Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋! We've added a new `verifyToken` field to your evaluation results to verify that they are produced by the model evaluator. Accept this PR to ensure that your results remain listed as **verified** on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards).

Files changed (1) hide show
  1. README.md +25 -19
README.md CHANGED
@@ -2,6 +2,13 @@
2
  license: mit
3
  tags:
4
  - generated_from_trainer
 
 
 
 
 
 
 
5
  model-index:
6
  - name: rob-base-superqa
7
  results:
@@ -14,14 +21,16 @@ model-index:
14
  config: adversarialQA
15
  split: validation
16
  metrics:
17
- - name: Exact Match
18
- type: exact_match
19
  value: 43.8667
 
20
  verified: true
21
- - name: F1
22
- type: f1
23
  value: 55.135
 
24
  verified: true
 
25
  - task:
26
  type: question-answering
27
  name: Question Answering
@@ -31,14 +40,16 @@ model-index:
31
  config: squad_v2
32
  split: validation
33
  metrics:
34
- - name: Exact Match
35
- type: exact_match
36
  value: 79.2432
 
37
  verified: true
38
- - name: F1
39
- type: f1
40
  value: 82.336
 
41
  verified: true
 
42
  - task:
43
  type: question-answering
44
  name: Question Answering
@@ -48,21 +59,16 @@ model-index:
48
  config: default
49
  split: validation
50
  metrics:
51
- - name: Exact Match
52
- type: exact_match
53
  value: 78.8581
 
54
  verified: true
55
- - name: F1
56
- type: f1
57
  value: 82.8261
 
58
  verified: true
59
- task:
60
- - question-answering
61
- datasets:
62
- - squad_v2
63
- - quoref
64
- - adversarial_qa
65
- - duorc
66
  ---
67
 
68
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
2
  license: mit
3
  tags:
4
  - generated_from_trainer
5
+ datasets:
6
+ - squad_v2
7
+ - quoref
8
+ - adversarial_qa
9
+ - duorc
10
+ task:
11
+ - question-answering
12
  model-index:
13
  - name: rob-base-superqa
14
  results:
 
21
  config: adversarialQA
22
  split: validation
23
  metrics:
24
+ - type: exact_match
 
25
  value: 43.8667
26
+ name: Exact Match
27
  verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzIxMWZiZWM1MTJmMGIxM2I5NTFjNGI5OTJiNDdjODQ3NDNkYjRkYTI3ZmZkNGVmMGYzZDk5MTZhNDE4YzI1YiIsInZlcnNpb24iOjF9.QAj_iwD0yN2woSbGAN9xVRKoDKxldZbleFeJr77P2s7xWQBsKCuY0b5-2WIL79EcTCChvjNITeriPXqz8mGMAw
29
+ - type: f1
30
  value: 55.135
31
+ name: F1
32
  verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjJkMzZjNTVhZTI5OTVhNTU4NDcyMjM1ZWJiODVjNzBhODRmZjlmMjE0MDUzMmU4NzNlNzA5NjgyODdkNTJmZSIsInZlcnNpb24iOjF9.O0KoLquXYbF3P2PGCFW8bxYEVe_yDW-WzEqpOmbIs_e9v4tcygH19ZUYFjMDFSll91SPJ2oIbVovsUISYuknCg
34
  - task:
35
  type: question-answering
36
  name: Question Answering
 
40
  config: squad_v2
41
  split: validation
42
  metrics:
43
+ - type: exact_match
 
44
  value: 79.2432
45
+ name: Exact Match
46
  verified: true
47
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjBjZjhjMTMzMzZhOTg1OGYyZDY2MzZjYmQ4NmNlNWI5MWNmNTBiZjY1Njg0YTYyMmRlNzlkZDU1NTZjOWM5ZCIsInZlcnNpb24iOjF9.1vo9JoASJ_zvOVa4lTRMNPljUvMon-E6QOZ1n_KFQBMtRvRY883ECudhAzb5LGpLntyM2EN5bfyfTQ6dfjjsDg
48
+ - type: f1
49
  value: 82.336
50
+ name: F1
51
  verified: true
52
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOWFiZGMyMzkwOTlkMWVkZmExNjdkZTM1YjRkYzRkZDlhOGZlMjEwMGNjNjJhYjM5MjZlNDI3ZDEyNmViOGYyOSIsInZlcnNpb24iOjF9.f3xlhop8hXWCCWFXWZgyK9r8Cy5KE3gPgYNV3bRN78teN_hjYH5sDl4wMTMcPU-bsPX70_wvsuvU-r95ByF4Bg
53
  - task:
54
  type: question-answering
55
  name: Question Answering
 
59
  config: default
60
  split: validation
61
  metrics:
62
+ - type: exact_match
 
63
  value: 78.8581
64
+ name: Exact Match
65
  verified: true
66
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjMyYTAxOWJhYTM5YWNmNGFhZDg3NTIwN2UxN2RhYzQxYzFiODJjYTcyZTk5MGMwODNhMzA3Nzc3MDQzYjcwMiIsInZlcnNpb24iOjF9.FSNswUf1Y5ZnlS0fSm-lxsA1klUphzfDhfj00U5benVd0QiYvyeqRclC7Pw8B3RV9Oe1cZzfeDDA5fXY2A5JBw
67
+ - type: f1
68
  value: 82.8261
69
+ name: F1
70
  verified: true
71
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNWQyMTNhZTc0MTdiMzNiNzc3YzhkNTk5ZWRkMWZlYjc4ZGU3YTFkNDkyZDg0NWFiYzFhMGQyMzZjYjcwNTE1YSIsInZlcnNpb24iOjF9.9waqQm_EBPo41pdOMmoY6r_-K7-3zUxt1AB4ndHTY50S5k5yyub8NdCJz09hBhbRd1_-1t3UT5p8HnFjAjF9DQ
 
 
 
 
 
 
72
  ---
73
 
74
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You