autoevaluator HF staff commited on
Commit
c711475
1 Parent(s): 238fce8

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v3 config and test split of mathemakitten/winobias_antistereotype_test_cot_v3

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v3 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v3](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v3) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v3-math-468e93-2011366586).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v3).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v3).

Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -77,6 +77,23 @@ model-index:
77
  type: loss
78
  value: 0.7550815605928027
79
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
80
  ---
81
 
82
  # OPT : Open Pre-trained Transformer Language Models
 
77
  type: loss
78
  value: 0.7550815605928027
79
  verified: true
80
+ - task:
81
+ type: zero-shot-classification
82
+ name: Zero-Shot Text Classification
83
+ dataset:
84
+ name: mathemakitten/winobias_antistereotype_test_cot_v3
85
+ type: mathemakitten/winobias_antistereotype_test_cot_v3
86
+ config: mathemakitten--winobias_antistereotype_test_cot_v3
87
+ split: test
88
+ metrics:
89
+ - name: Accuracy
90
+ type: accuracy
91
+ value: 0.3737864077669903
92
+ verified: true
93
+ - name: Loss
94
+ type: loss
95
+ value: 1.2823651640752816
96
+ verified: true
97
  ---
98
 
99
  # OPT : Open Pre-trained Transformer Language Models