autoevaluator HF staff commited on
Commit
e05e849
1 Parent(s): ed091de

Add evaluation results on the mathemakitten--winobias_antistereotype_dev config and validation split of mathemakitten/winobias_antistereotype_dev

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_dev config and validation split of the [mathemakitten/winobias_antistereotype_dev](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_dev) dataset by

@gmcather

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_dev-mathemakitte-c87316-2844283322).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_dev).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_dev).

Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -4,7 +4,26 @@ tags:
4
  - generated_from_trainer
5
  model-index:
6
  - name: opt-125m-wikitext2
7
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
4
  - generated_from_trainer
5
  model-index:
6
  - name: opt-125m-wikitext2
7
+ results:
8
+ - task:
9
+ type: zero-shot-classification
10
+ name: Zero-Shot Text Classification
11
+ dataset:
12
+ name: mathemakitten/winobias_antistereotype_dev
13
+ type: mathemakitten/winobias_antistereotype_dev
14
+ config: mathemakitten--winobias_antistereotype_dev
15
+ split: validation
16
+ metrics:
17
+ - type: accuracy
18
+ value: 0.4375
19
+ name: Accuracy
20
+ verified: true
21
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiOTYwMThiZjYwMmNmZDA4ZmUyNzYwNjU3MGU0NTRhZjI5NWVjNGRmOTRhZWRlZWVmYzQ5MWRiNmQ0MzRlMjU0NyIsInZlcnNpb24iOjF9.hqz3Xwego8qtofxVwg0yO0Ovk6G_kXYj55gPPLzmBFYJQOuy9kSa8g2Sm_GRTU0GSU10rAtDzYCSyehqz58VBQ
22
+ - type: loss
23
+ value: 1.1480248920549638
24
+ name: Loss
25
+ verified: true
26
+ verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDhkODZmMzgyODQyNzYzZGE4Nzc2YjhkZTk0ZDgwMGUwMjI0ZjliMmQyODM3NzlkODkwOWEwYzBlNWU4NzlmMCIsInZlcnNpb24iOjF9.qlhKo7zXMRYAYWTszoZ7VQqF52mCDBQ7PRLD6y6doFmAnmKa8nv4VQeDBFdYwomMA1Rdw2jwqhYcwbgSYEm0CA
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You