autoevaluator HF staff commited on
Commit
eb9c24c
1 Parent(s): 7735f02

Add verifyToken field to verify evaluation results are produced by Hugging Face's automatic model evaluator

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋! We've added a new `verifyToken` field to your evaluation results to verify that they are produced by the model evaluator. Accept this PR to ensure that your results remain listed as **verified** on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards).

Files changed (1) hide show
  1. README.md +23 -17
README.md CHANGED
@@ -13,19 +13,19 @@ model-index:
13
  - name: glue-mrpc
14
  results:
15
  - task:
16
- name: Text Classification
17
  type: text-classification
 
18
  dataset:
19
  name: GLUE MRPC
20
  type: glue
21
  args: mrpc
22
  metrics:
23
- - name: Accuracy
24
- type: accuracy
25
  value: 0.8553921568627451
26
- - name: F1
27
- type: f1
28
  value: 0.897391304347826
 
29
  - task:
30
  type: natural-language-inference
31
  name: Natural Language Inference
@@ -35,30 +35,36 @@ model-index:
35
  config: mrpc
36
  split: validation
37
  metrics:
38
- - name: Accuracy
39
- type: accuracy
40
  value: 0.8553921568627451
 
41
  verified: true
42
- - name: Precision
43
- type: precision
44
  value: 0.8716216216216216
 
45
  verified: true
46
- - name: Recall
47
- type: recall
48
  value: 0.9247311827956989
 
49
  verified: true
50
- - name: AUC
51
- type: auc
52
  value: 0.90464282737351
 
53
  verified: true
54
- - name: F1
55
- type: f1
56
  value: 0.897391304347826
 
57
  verified: true
58
- - name: loss
59
- type: loss
60
  value: 0.6564616560935974
 
61
  verified: true
 
62
  ---
63
 
64
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
13
  - name: glue-mrpc
14
  results:
15
  - task:
 
16
  type: text-classification
17
+ name: Text Classification
18
  dataset:
19
  name: GLUE MRPC
20
  type: glue
21
  args: mrpc
22
  metrics:
23
+ - type: accuracy
 
24
  value: 0.8553921568627451
25
+ name: Accuracy
26
+ - type: f1
27
  value: 0.897391304347826
28
+ name: F1
29
  - task:
30
  type: natural-language-inference
31
  name: Natural Language Inference
 
35
  config: mrpc
36
  split: validation
37
  metrics:
38
+ - type: accuracy
 
39
  value: 0.8553921568627451
40
+ name: Accuracy
41
  verified: true
42
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNmU4NDJlOWE2NmYyNjMzYWQwZWU4MTE4Mjg1ZTFmMmUzOTYxZTU1OWRmYTRiZjY4YWI0OGQ2NTUzYzA2MDRhZSIsInZlcnNpb24iOjF9.iNArgwgVl4QZlyMf4VNiZXJxG6gjG60S2k81LPI8lMwU7dr-SUDabagR0kRGMRoOck45h4G8x7sHtQU2AnBUAQ
43
+ - type: precision
44
  value: 0.8716216216216216
45
+ name: Precision
46
  verified: true
47
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWNiZjBhMjQyNGYxNjcwODFjYzBlYjMyODVmODlhNWM1OTcxYzE3NTYxOGQ4NWM1NGE3YjRhMGQ4OGNhMWRmYyIsInZlcnNpb24iOjF9.q3k_FnrmYo3pP_8l2IudhN1zctJPlUm7hrzAmc-32nt_sIJ7hRpJogm30pSrhDDDqwC-gKBz2pApetsPSw-tBQ
48
+ - type: recall
49
  value: 0.9247311827956989
50
+ name: Recall
51
  verified: true
52
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWY0MTljNWY4MjE2MTU1MTg1ZDU0Nzk4MzhiMGE3OTgzNTQyZDk1YWYyMjdlYTI2NDI5Yjc1YjllMGEzYzllNSIsInZlcnNpb24iOjF9.YFaiSeGk-4BfSiAEJIj45smxjib8jBsm99IVXW2FHIDGCaJu9__afJszeWnLgnd1MaUvKlk8DushrbNaI_xrCA
53
+ - type: auc
54
  value: 0.90464282737351
55
+ name: AUC
56
  verified: true
57
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOGJkOWNmNzUyMzUzMGNiZGIyYTdlZGVlYjFlMjg5ZTc0NGY5MWRjYjVhZDMyZGUxZjk2ZjE1NTIwN2ZjOGVlNyIsInZlcnNpb24iOjF9.YLFNmEt5Iyx2lTF0M5B9GOuNfJy4b30Cx_ccOe3EIzRHbnmvNUAYtA33AqEWFDDGBCkM3O1BBvQrB-79CYx2DQ
58
+ - type: f1
59
  value: 0.897391304347826
60
+ name: F1
61
  verified: true
62
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZjJiNjUxM2YxOTVhNjg1OThlMGI2NDQ2MmZjN2YyZDM4NDI4MDUyZjFjNTQwNDAzMjEyN2M2MzQxMTZhZTE4ZiIsInZlcnNpb24iOjF9.U8RflOAuvMqrWPyy3C5tU7eWRsVNi8o1IBA_l1rgH0UbbBHDXoAvJZiZaHXXV_pfI-Mrz34E4XFhHoFAwJlbDg
63
+ - type: loss
64
  value: 0.6564616560935974
65
+ name: loss
66
  verified: true
67
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOGNlZTFhZDQ0MzE3YzVkMTM1YzZmYzMzNWRhZjM1OGUwYWY5YjA2YTA2YTMyMTU4MjE4M2U1ODRjMWJlMDdmOSIsInZlcnNpb24iOjF9.zEMs0_m4UGoJpUQHkcCXUhP5QLTn6on78JJIFZEMDpL8YMuWQk75-urKgfxxb1STM0Vt6SL8JFR4bz-i7MbgCQ
68
  ---
69
 
70
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You