autoevaluator HF staff commited on
Commit
ec5561d
1 Parent(s): 6129b11

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v4 config and test split of mathemakitten/winobias_antistereotype_test_cot_v4

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v4 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v4](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v4) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v4-math-54ae93-2018366736).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v4).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v4).

Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -111,6 +111,23 @@ model-index:
111
  type: loss
112
  value: 1.539870785999474
113
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
114
  ---
115
 
116
  # OPT : Open Pre-trained Transformer Language Models
 
111
  type: loss
112
  value: 1.539870785999474
113
  verified: true
114
+ - task:
115
+ type: zero-shot-classification
116
+ name: Zero-Shot Text Classification
117
+ dataset:
118
+ name: mathemakitten/winobias_antistereotype_test_cot_v4
119
+ type: mathemakitten/winobias_antistereotype_test_cot_v4
120
+ config: mathemakitten--winobias_antistereotype_test_cot_v4
121
+ split: test
122
+ metrics:
123
+ - name: Accuracy
124
+ type: accuracy
125
+ value: 0.3131067961165049
126
+ verified: true
127
+ - name: Loss
128
+ type: loss
129
+ value: 1.4315469591985621
130
+ verified: true
131
  ---
132
 
133
  # OPT : Open Pre-trained Transformer Language Models