autoevaluator HF staff commited on
Commit
d05c674
1 Parent(s): f454423

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v3 config and test split of mathemakitten/winobias_antistereotype_test_cot_v3

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v3 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v3](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v3) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v3-math-468e93-2011366587).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v3).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v3).

Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -94,6 +94,23 @@ model-index:
94
  type: loss
95
  value: 1.3472958901782706
96
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
97
  ---
98
 
99
  # OPT : Open Pre-trained Transformer Language Models
 
94
  type: loss
95
  value: 1.3472958901782706
96
  verified: true
97
+ - task:
98
+ type: zero-shot-classification
99
+ name: Zero-Shot Text Classification
100
+ dataset:
101
+ name: mathemakitten/winobias_antistereotype_test_cot_v3
102
+ type: mathemakitten/winobias_antistereotype_test_cot_v3
103
+ config: mathemakitten--winobias_antistereotype_test_cot_v3
104
+ split: test
105
+ metrics:
106
+ - name: Accuracy
107
+ type: accuracy
108
+ value: 0.38106796116504854
109
+ verified: true
110
+ - name: Loss
111
+ type: loss
112
+ value: 1.2352868161494008
113
+ verified: true
114
  ---
115
 
116
  # OPT : Open Pre-trained Transformer Language Models