autoevaluator HF staff commited on
Commit
3419c28
1 Parent(s): 8dc17cd

Add evaluation results on the mathemakitten--winobias_antistereotype_test_cot_v1 config and test split of mathemakitten/winobias_antistereotype_test_cot_v1

Browse files

Beep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the mathemakitten--winobias_antistereotype_test_cot_v1 config and test split of the [mathemakitten/winobias_antistereotype_test_cot_v1](https://huggingface.co/datasets/mathemakitten/winobias_antistereotype_test_cot_v1) dataset by

@mathemakitten

, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-eval-mathemakitten__winobias_antistereotype_test_cot_v1-math-6c03d1-1913164906).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=mathemakitten/winobias_antistereotype_test_cot_v1).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=mathemakitten/winobias_antistereotype_test_cot_v1).

Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -4,9 +4,28 @@ inference: false
4
  tags:
5
  - text-generation
6
  - opt
7
-
8
  license: other
9
  commercial: false
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  # OPT : Open Pre-trained Transformer Language Models
 
4
  tags:
5
  - text-generation
6
  - opt
 
7
  license: other
8
  commercial: false
9
+ model-index:
10
+ - name: facebook/opt-6.7b
11
+ results:
12
+ - task:
13
+ type: zero-shot-classification
14
+ name: Zero-Shot Text Classification
15
+ dataset:
16
+ name: mathemakitten/winobias_antistereotype_test_cot_v1
17
+ type: mathemakitten/winobias_antistereotype_test_cot_v1
18
+ config: mathemakitten--winobias_antistereotype_test_cot_v1
19
+ split: test
20
+ metrics:
21
+ - name: Accuracy
22
+ type: accuracy
23
+ value: 0.3762135922330097
24
+ verified: true
25
+ - name: Loss
26
+ type: loss
27
+ value: 1.3691492240410563
28
+ verified: true
29
  ---
30
 
31
  # OPT : Open Pre-trained Transformer Language Models