lewtun HF staff commited on
Commit
4dc632f
1 Parent(s): 8734459

Add evaluation results on the sst2 config and validation split of glue

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the sst2 config and validation split of the [glue](https://huggingface.co/datasets/glue) dataset by

@lewtun

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-project-efa0c910-63e6-4e94-9ead-ecdfc9f84f6e-117113).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=glue).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=glue).

Files changed (1) hide show
  1. README.md +47 -0
README.md CHANGED
@@ -4,6 +4,53 @@ tags:
4
  - generated_from_trainer
5
  datasets:
6
  - glue
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ---
8
 
9
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
4
  - generated_from_trainer
5
  datasets:
6
  - glue
7
+ model-index:
8
+ - name: autoevaluate/binary-classification-not-evaluated
9
+ results:
10
+ - task:
11
+ type: text-classification
12
+ name: Text Classification
13
+ dataset:
14
+ name: glue
15
+ type: glue
16
+ config: sst2
17
+ split: validation
18
+ metrics:
19
+ - type: accuracy
20
+ value: 0.8967889908256881
21
+ name: Accuracy
22
+ verified: true
23
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZTFhNTM5OGFkNTYxNmM5OTRmNmI0MWU1MWFiYzM5ODM0MTdiYmZmYmExOTI5ZTQzNGQ0YWRlNjQ2MjdjOWFhYSIsInZlcnNpb24iOjF9.fcoYl-t_iYhGKGJqLB-AGrmAsd_QkUXWJFsxdi-x6RjTJeCevEHSRABdLKM2UM7yJF8nGwvWjI68r1fJ1OlSCw
24
+ - type: precision
25
+ value: 0.8898678414096917
26
+ name: Precision
27
+ verified: true
28
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2FjZWMyMzUzYjI5MDdkN2M3OGZhMzU0YmRlMDQwZjc4ZWU4ZTljYWFjZDVkMzRkMTBiOGM4YmQyMjM0YTUyOCIsInZlcnNpb24iOjF9.7d28G0boU5Xc-3-ox3040mluwIbls0pjLG8XROJaqkG6ei0HVKyTds1fzgr3-JZxK6wylItVGDPg0Z5MAa5yAA
29
+ - type: recall
30
+ value: 0.9099099099099099
31
+ name: Recall
32
+ verified: true
33
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDhhZThhZjE2YzliYTAxODQ4NDNiYTM4OGQxOGQ0NzU3YzljZjViNDEyODgwNjg3NGFkZDU3MGVjNDM5ZmE2MyIsInZlcnNpb24iOjF9.eMy2JTxw821ff8umlAyX20SGSlll2e2yaVaEab3gl5xwU36qocNBve_IfluAox4J5bg8VCKhRdR-yzhJ01IZAw
34
+ - type: auc
35
+ value: 0.9672186789593331
36
+ name: AUC
37
+ verified: true
38
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiN2U5YTRmM2FjNDdmYmMzY2U0MzliMWYxOGYwMGYyNTJkMzk2YjRhMWJhMTAzMDU1NmUyNjEzYjA1NTBiNmNlMSIsInZlcnNpb24iOjF9.iWtm0L1Fvfrh5S4DEkZCx2ewFajs26DpFbX8YAOay_dkFdpgJGbr6avAyKg-tUXjUGpinW_DpeGnluXF-MtQAw
39
+ - type: f1
40
+ value: 0.8997772828507795
41
+ name: F1
42
+ verified: true
43
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYTgwMGM1NzMzZTBkMTA2YjAyYjhkZGYzZWQxZDY4ZjIzNmZkY2U1Mjk0NGZkOGVkN2QxZmMzMjdkNWIzOWYwZiIsInZlcnNpb24iOjF9.MT-ofNgyx-zxqwBjbzW5oeFG0YOAcN9OZQNpbJSvGZDWRi6ZWd5hrWohAEviNHA12LQsdu4s5oRgPpWPe25kAA
44
+ - type: loss
45
+ value: 0.30092036724090576
46
+ name: loss
47
+ verified: true
48
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiM2ZjY2FjM2M1MmM2ZmRjYmVhMGY2YTgxZjhhMTFlNjY3OTg0MzUzZjYzZWMxZTAxNTc5MjhkMDY0NzhkYTBkNSIsInZlcnNpb24iOjF9.2JQmUWcTR6_8dsFeBKt_UG0dg-qJFIIoDFxYx2O059ikdIBKHu5DqY0U2aJvuyTyWxzKxOxkSStzRSZEKOf-Bw
49
+ - type: matthews_correlation
50
+ value: 0.793630584795814
51
+ name: matthews_correlation
52
+ verified: true
53
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjZkY2IzNmMwNGY0N2NiNGQ4MGI2Yzk3YTY1M2ExZjBmYTIyMGM1YzA4NzRiMWY0YTZlOTY2YmY4NWMxYTliNSIsInZlcnNpb24iOjF9.c7TFOc93GiblJ49JbsWknmj0yPFAvO50eep4Dcof8aKbysNxDuprg67CdWN7WqIU3cEFgIcRPyC6nX5t44fHDg
54
  ---
55
 
56
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You