autoevaluator HF staff commited on
Commit
d9b655f
1 Parent(s): f18a3cb

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v1 config and test split of mathemakitten/winobias_antistereotype_test_cot_v1

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v1 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v1](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v1) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v1-math-1bbcaf-1917164990).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v1).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v1).

Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -77,6 +77,23 @@ model-index:
77
  type: loss
78
  value: 0.8809041155236108
79
  verified: true
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
80
  ---
81
 
82
  # OPT : Open Pre-trained Transformer Language Models
 
77
  type: loss
78
  value: 0.8809041155236108
79
  verified: true
80
+ - task:
81
+ type: zero-shot-classification
82
+ name: Zero-Shot Text Classification
83
+ dataset:
84
+ name: mathemakitten/winobias_antistereotype_test_cot_v1
85
+ type: mathemakitten/winobias_antistereotype_test_cot_v1
86
+ config: mathemakitten--winobias_antistereotype_test_cot_v1
87
+ split: test
88
+ metrics:
89
+ - name: Accuracy
90
+ type: accuracy
91
+ value: 0.39563106796116504
92
+ verified: true
93
+ - name: Loss
94
+ type: loss
95
+ value: 1.294413821680473
96
+ verified: true
97
  ---
98
 
99
  # OPT : Open Pre-trained Transformer Language Models