Zero-Shot Classification
Transformers
PyTorch
Safetensors
English
deberta-v2
text-classification
deberta-v3-base
deberta-v3
deberta
nli
natural-language-inference
multitask
multi-task
pipeline
extreme-multi-task
extreme-mtl
tasksource
zero-shot
rlhf
Eval Results
Inference Endpoints
sileod commited on
Commit
b8d479c
1 Parent(s): fac6592

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +182 -118
README.md CHANGED
@@ -17,141 +17,205 @@ tags:
17
  - rlhf
18
  pipeline_tag: zero-shot-classification
19
  datasets:
20
- - hellaswag
21
- - ag_news
22
- - pietrolesci/nli_fever
23
- - numer_sense
24
- - go_emotions
25
- - Ericwang/promptProficiency
26
- - poem_sentiment
27
- - pietrolesci/robust_nli_is_sd
28
- - sileod/probability_words_nli
29
- - social_i_qa
30
- - trec
31
- - imppres
32
- - pietrolesci/gen_debiased_nli
33
- - snips_built_in_intents
34
- - metaeval/imppres
35
- - metaeval/crowdflower
36
- - tals/vitaminc
37
- - dream
38
- - metaeval/babi_nli
39
- - Ericwang/promptSpoke
40
- - metaeval/ethics
41
- - art
42
- - ai2_arc
43
- - discovery
44
- - Ericwang/promptGrammar
45
- - code_x_glue_cc_clone_detection_big_clone_bench
46
- - prajjwal1/discosense
47
- - pietrolesci/joci
48
- - Anthropic/model-written-evals
49
- - utilitarianism
50
- - emo
51
- - tweets_hate_speech_detection
52
- - piqa
53
- - blog_authorship_corpus
54
- - SpeedOfMagic/ontonotes_english
55
- - circa
56
- - app_reviews
57
- - anli
58
- - Ericwang/promptSentiment
59
- - codah
60
- - definite_pronoun_resolution
61
- - health_fact
62
- - tweet_eval
63
- - hate_speech18
64
  - glue
65
- - hendrycks_test
66
- - paws
67
- - bigbench
68
- - hate_speech_offensive
69
- - blimp
70
  - sick
71
- - turingbench/TuringBench
72
- - martn-nguyen/contrast_nli
73
- - Anthropic/hh-rlhf
74
- - openbookqa
75
- - species_800
76
  - alisawuffles/WANLI
77
- - ethos
 
 
 
 
 
 
 
78
  - pietrolesci/mpe
79
- - wiki_hop
80
- - pietrolesci/glue_diagnostics
81
- - mc_taco
82
- - quarel
83
- - PiC/phrase_similarity
84
- - strombergnlp/rumoureval_2019
85
- - quail
86
- - acronym_identification
87
- - pietrolesci/robust_nli
88
- - quora
89
- - wnut_17
90
- - dynabench/dynasent
91
  - pietrolesci/gpt3_nli
92
- - truthful_qa
 
 
 
 
 
 
93
  - pietrolesci/add_one_rte
94
- - pietrolesci/breaking_nli
95
- - copenlu/scientific-exaggeration-detection
 
 
 
96
  - medical_questions_pairs
97
- - rotten_tomatoes
98
- - scicite
99
- - scitail
100
- - pietrolesci/dialogue_nli
101
- - code_x_glue_cc_defect_detection
102
  - nightingal3/fig-qa
103
- - pietrolesci/conj_nli
104
- - liar
105
- - sciq
106
- - head_qa
107
- - pietrolesci/dnc
 
 
108
  - quartz
 
 
 
 
 
109
  - wiqa
110
- - code_x_glue_cc_code_refinement
111
- - Ericwang/promptCoherence
112
- - joey234/nan-nli
113
- - hope_edi
114
- - jnlpba
115
- - yelp_review_full
116
- - pietrolesci/recast_white
 
 
 
117
  - swag
118
- - banking77
119
- - cosmos_qa
120
- - financial_phrasebank
121
- - hans
122
- - pietrolesci/fracas
123
  - math_qa
124
- - conll2003
125
- - qasc
126
- - ncbi_disease
127
- - mwong/fever-evidence-related
128
- - YaHi/EffectiveFeedbackStudentWriting
129
- - ade_corpus_v2
130
- - amazon_polarity
131
- - pietrolesci/robust_nli_li_ts
132
- - super_glue
133
- - adv_glue
134
- - Ericwang/promptNLI
135
- - cos_e
136
- - launch/open_question_type
137
- - lex_glue
138
- - has_part
139
  - pragmeval
140
- - sem_eval_2010_task_8
 
 
141
  - imdb
142
- - humicroedit
143
- - sms_spam
 
 
 
144
  - dbpedia_14
145
- - commonsense_qa
146
- - hlgd
147
- - snli
148
- - hyperpartisan_news_detection
149
- - google_wellformed_query
150
- - raquiba/Sarcasm_News_Headline
151
- - metaeval/recast
152
- - winogrande
 
 
 
 
 
 
 
 
153
  - relbert/lexical_relation_classification
154
  - metaeval/linguisticprobing
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
155
  metrics:
156
  - accuracy
157
  library_name: transformers
 
17
  - rlhf
18
  pipeline_tag: zero-shot-classification
19
  datasets:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  - glue
21
+ - super_glue
22
+ - anli
23
+ - metaeval/babi_nli
 
 
24
  - sick
25
+ - snli
26
+ - scitail
27
+ - hans
 
 
28
  - alisawuffles/WANLI
29
+ - metaeval/recast
30
+ - sileod/probability_words_nli
31
+ - joey234/nan-nli
32
+ - pietrolesci/nli_fever
33
+ - pietrolesci/breaking_nli
34
+ - pietrolesci/conj_nli
35
+ - pietrolesci/fracas
36
+ - pietrolesci/dialogue_nli
37
  - pietrolesci/mpe
38
+ - pietrolesci/dnc
 
 
 
 
 
 
 
 
 
 
 
39
  - pietrolesci/gpt3_nli
40
+ - pietrolesci/recast_white
41
+ - pietrolesci/joci
42
+ - martn-nguyen/contrast_nli
43
+ - pietrolesci/robust_nli
44
+ - pietrolesci/robust_nli_is_sd
45
+ - pietrolesci/robust_nli_li_ts
46
+ - pietrolesci/gen_debiased_nli
47
  - pietrolesci/add_one_rte
48
+ - metaeval/imppres
49
+ - pietrolesci/glue_diagnostics
50
+ - hlgd
51
+ - paws
52
+ - quora
53
  - medical_questions_pairs
54
+ - conll2003
55
+ - Anthropic/hh-rlhf
56
+ - Anthropic/model-written-evals
57
+ - truthful_qa
 
58
  - nightingal3/fig-qa
59
+ - tasksource/bigbench
60
+ - blimp
61
+ - cos_e
62
+ - cosmos_qa
63
+ - dream
64
+ - openbookqa
65
+ - qasc
66
  - quartz
67
+ - quail
68
+ - head_qa
69
+ - sciq
70
+ - social_i_qa
71
+ - wiki_hop
72
  - wiqa
73
+ - piqa
74
+ - hellaswag
75
+ - pkavumba/balanced-copa
76
+ - 12ml/e-CARE
77
+ - art
78
+ - tasksource/mmlu
79
+ - winogrande
80
+ - codah
81
+ - ai2_arc
82
+ - definite_pronoun_resolution
83
  - swag
 
 
 
 
 
84
  - math_qa
85
+ - metaeval/utilitarianism
86
+ - mteb/amazon_counterfactual
87
+ - SetFit/insincere-questions
88
+ - SetFit/toxic_conversations
89
+ - turingbench/TuringBench
90
+ - trec
91
+ - tals/vitaminc
92
+ - hope_edi
93
+ - strombergnlp/rumoureval_2019
94
+ - ethos
95
+ - tweet_eval
96
+ - discovery
 
 
 
97
  - pragmeval
98
+ - silicone
99
+ - lex_glue
100
+ - papluca/language-identification
101
  - imdb
102
+ - rotten_tomatoes
103
+ - ag_news
104
+ - yelp_review_full
105
+ - financial_phrasebank
106
+ - poem_sentiment
107
  - dbpedia_14
108
+ - amazon_polarity
109
+ - app_reviews
110
+ - hate_speech18
111
+ - sms_spam
112
+ - humicroedit
113
+ - snips_built_in_intents
114
+ - banking77
115
+ - hate_speech_offensive
116
+ - yahoo_answers_topics
117
+ - pacovaldez/stackoverflow-questions
118
+ - zapsdcn/hyperpartisan_news
119
+ - zapsdcn/sciie
120
+ - zapsdcn/citation_intent
121
+ - go_emotions
122
+ - scicite
123
+ - liar
124
  - relbert/lexical_relation_classification
125
  - metaeval/linguisticprobing
126
+ - metaeval/crowdflower
127
+ - metaeval/ethics
128
+ - emo
129
+ - google_wellformed_query
130
+ - tweets_hate_speech_detection
131
+ - has_part
132
+ - wnut_17
133
+ - ncbi_disease
134
+ - acronym_identification
135
+ - jnlpba
136
+ - species_800
137
+ - SpeedOfMagic/ontonotes_english
138
+ - blog_authorship_corpus
139
+ - launch/open_question_type
140
+ - health_fact
141
+ - commonsense_qa
142
+ - mc_taco
143
+ - ade_corpus_v2
144
+ - prajjwal1/discosense
145
+ - circa
146
+ - YaHi/EffectiveFeedbackStudentWriting
147
+ - Ericwang/promptSentiment
148
+ - Ericwang/promptNLI
149
+ - Ericwang/promptSpoke
150
+ - Ericwang/promptProficiency
151
+ - Ericwang/promptGrammar
152
+ - Ericwang/promptCoherence
153
+ - PiC/phrase_similarity
154
+ - copenlu/scientific-exaggeration-detection
155
+ - quarel
156
+ - mwong/fever-evidence-related
157
+ - numer_sense
158
+ - dynabench/dynasent
159
+ - raquiba/Sarcasm_News_Headline
160
+ - sem_eval_2010_task_8
161
+ - demo-org/auditor_review
162
+ - medmcqa
163
+ - aqua_rat
164
+ - RuyuanWan/Dynasent_Disagreement
165
+ - RuyuanWan/Politeness_Disagreement
166
+ - RuyuanWan/SBIC_Disagreement
167
+ - RuyuanWan/SChem_Disagreement
168
+ - RuyuanWan/Dilemmas_Disagreement
169
+ - lucasmccabe/logiqa
170
+ - wiki_qa
171
+ - metaeval/cycic_classification
172
+ - metaeval/cycic_multiplechoice
173
+ - metaeval/sts-companion
174
+ - metaeval/commonsense_qa_2.0
175
+ - metaeval/lingnli
176
+ - metaeval/monotonicity-entailment
177
+ - metaeval/arct
178
+ - metaeval/scinli
179
+ - metaeval/naturallogic
180
+ - onestop_qa
181
+ - demelin/moral_stories
182
+ - corypaik/prost
183
+ - aps/dynahate
184
+ - metaeval/syntactic-augmentation-nli
185
+ - metaeval/autotnli
186
+ - lasha-nlp/CONDAQA
187
+ - openai/webgpt_comparisons
188
+ - Dahoas/synthetic-instruct-gptj-pairwise
189
+ - metaeval/scruples
190
+ - metaeval/wouldyourather
191
+ - sileod/attempto-nli
192
+ - metaeval/defeasible-nli
193
+ - metaeval/help-nli
194
+ - metaeval/nli-veridicality-transitivity
195
+ - metaeval/natural-language-satisfiability
196
+ - metaeval/lonli
197
+ - metaeval/dadc-limit-nli
198
+ - ColumbiaNLP/FLUTE
199
+ - metaeval/strategy-qa
200
+ - openai/summarize_from_feedback
201
+ - metaeval/folio
202
+ - metaeval/tomi-nli
203
+ - metaeval/avicenna
204
+ - stanfordnlp/SHP
205
+ - GBaker/MedQA-USMLE-4-options-hf
206
+ - sileod/wikimedqa
207
+ - declare-lab/cicero
208
+ - amydeng2000/CREAK
209
+ - metaeval/mutual
210
+ - inverse-scaling/NeQA
211
+ - inverse-scaling/quote-repetition
212
+ - inverse-scaling/redefine-math
213
+ - metaeval/puzzte
214
+ - metaeval/implicatures
215
+ - race
216
+ - metaeval/spartqa-yn
217
+ - metaeval/spartqa-mchoice
218
+ - metaeval/temporal-nli
219
  metrics:
220
  - accuracy
221
  library_name: transformers