Hugging Face
Models
Datasets
Spaces
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
1
Languages
Licenses
Other
Reset Datasets
No match found for active filter
Datasets with no match
glue
squad
mozilla-foundation/common_voice_7_0
imdb
imagenet-1k
wikipedia
xtreme
common_voice
mozilla-foundation/common_voice_11_0
conll2003
bookcorpus
marsyas/gtzan
samsum
squad_v2
fka/awesome-chatgpt-prompts
Open-Orca/OpenOrca
clinc_oos
c4
super_glue
cnn_dailymail
OpenAssistant/oasst1
facebook/voxpopuli
billsum
garage-bAInd/Open-Platypus
beans
PolyAI/minds14
huggan/smithsonian_butterflies_subset
universal_dependencies
librispeech_asr
wmt16
oscar
mozilla-foundation/common_voice_13_0
google/fleurs
ehartford/dolphin
tweet_eval
databricks/databricks-dolly-15k
mozilla-foundation/common_voice_8_0
mc4
kde4
togethercomputer/RedPajama-Data-1T
amazon_reviews_multi
tatsu-lab/alpaca
tiiuae/falcon-refinedweb
sst2
jondurbin/airoboros-2.2.1
cc100
wnut_17
multi_nli
food101
klue
jondurbin/airoboros-3.1
relbert/semeval2012_relational_similarity
bigcode/starcoderdata
superb
opus_books
snli
scene_parse_150
gsm8k
cerebras/SlimPajama-627B
xnli
eli5
Anthropic/hh-rlhf
wikiann
ehartford/wizard_vicuna_70k_unfiltered
financial_phrasebank
cppe-5
cifar10
ms_marco
esb/datasets
Lajonbot/alpaca-dolly-chrisociepa-instruction-only-polish
natural_questions
wikitext
HuggingFaceH4/ultrafeedback_binarized
yelp_review_full
ehartford/samantha-data
jondurbin/airoboros-2.2
gokuls/wiki_book_corpus_complete_processed_bert_dataset
stsb_multi_mt
bigcode/the-stack-dedup
anon8231489123/ShareGPT_Vicuna_unfiltered
jondurbin/airoboros-gpt4-1.4.1
allenai/nllb
code_search_net
lmqg/qg_squad
ag_news
kejian/codeparrot-train-more-filter-3.3b-cleaned
WizardLM/WizardLM_evol_instruct_V2_196k
lmqg/qg_subjqa
ehartford/WizardLM_evol_instruct_V2_196k_unfiltered_merged_split
yahoo_answers_topics
trivia_qa
mozilla-foundation/common_voice_9_0
ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered
QingyiSi/Alpaca-CoT
yahma/alpaca-cleaned
tner/tweetner7
assin2
flax-sentence-embeddings/stackexchange_xml
go_emotions
swag
mbpp
banking77
PygmalionAI/PIPPA
masakhaner
EleutherAI/pile
bigcode/the-stack
wikihow
search_qa
embedding-data/sentence-compression
ncbi_disease
opus100
assin
wikisql
openai/summarize_from_feedback
gooaq
multi_news
LDJnr/Pure-Dove
OpenAssistant/oasst_top1_2023-08-25
embedding-data/WikiAnswers
HuggingFaceH4/ultrachat_200k
stingning/ultrachat
LDJnr/LessWrong-Amplify-Instruct
LDJnr/Verified-Camel
wiki_lingua
allenai/tulu-v2-sft-mixture
openslr
embedding-data/PAQ_pairs
timdettmers/openassistant-guanaco
embedding-data/altlex
embedding-data/simple-wiki
JosephusCheung/GuanacoDataset
rotten_tomatoes
sahil2801/CodeAlpaca-20k
unicamp-dl/mmarco
lmqg/qg_squadshifts
anli
jondurbin/airoboros-gpt4-m2.0
Gustavosta/Stable-Diffusion-Prompts
LDJnr/Puffin
detection-datasets/coco
scientific_papers
mlsum
mnist
bigbench
embedding-data/SPECTER
meta-math/MetaMathQA
yhavinga/mc4_nl_cleaned
gsdf/EasyNegative
Norquinal/claude_multiround_chat_30k
facebook/multilingual_librispeech
kmfoda/booksum
mrqa
Hello-SimpleAI/HC3
Nerfgun3/bad_prompt
race
lambdalabs/pokemon-blip-captions
lmqg/qg_jaquad
vctk
covid_qa_deepset
allenai/c4
pubmed
jondurbin/airoboros-2.1
teknium/GPTeacher-General-Instruct
HuggingFaceH4/no_robots
covost2
spider
riddle_sense
oscar-corpus/OSCAR-2301
kunishou/databricks-dolly-15k-ja
vicgalle/alpaca-gpt4
IlyaGusev/ru_turbo_alpaca
lmqg/qg_ruquad
lmqg/qg_esquad
fever
NbAiLab/NPSC
teknium/openhermes
tydiqa
flozi00/conversations
lmqg/qg_itquad
openai/webgpt_comparisons
squad_es
TIGER-Lab/MathInstruct
lmqg/qg_koquad
csebuetnlp/xlsum
GAIR/lima
pg19
esnli
EleutherAI/the_pile_deduplicated
multilingual_librispeech
openchat/openchat_sharegpt4_dataset
kyujinpy/OpenOrca-KO
b-mc2/sql-create-context
cifar100
mozilla-foundation/common_voice_12_0
AmazonScience/massive
lmqg/qg_dequad
lambada
mozilla-foundation/common_voice_10_0
paws-x
timit_asr
mlqa
german-nlp-group/german_common_crawl
ami
cuad
LeoLM/OpenSchnabeltier
OpenAssistant/OASST-DE
FreedomIntelligence/alpaca-gpt4-deutsch
FreedomIntelligence/evol-instruct-deutsch
LeoLM/German_Poems
LeoLM/German_Songs
totally-not-an-llm/EverythingLM-data-V3
RyokoAI/ShareGPT52K
stjiris/portuguese-legal-sentences-v0
Finnish-NLP/mc4_fi_cleaned
subjqa
bigscience/P3
wmt19
jfleg
emozilla/yarn-train-tokenized-16k-mistral
openbmb/UltraFeedback
amazon_polarity
Yaxin/SemEval2014Task4Raw
wmt14
competitions/aiornot
deepmind/code_contests
aqua_rat
emrgnt-cmplxty/sciphi-textbooks-are-all-you-need
BAAI/COIG
THUDM/AgentInstruct
daily_dialog
bigscience/xP3
roneneldan/TinyStories
mosaicml/dolly_hhrlhf
nomic-ai/gpt4all_prompt_generations
beomi/KoAlpaca-v1.1a
laion/OIG
Norquinal/claude_multiround_chat_1k
teknium/GPT4-LLM-Cleaned
jondurbin/airoboros-3.0
segments/sidewalk-semantic
IlyaGusev/ru_turbo_saiga
IlyaGusev/ru_sharegpt_cleaned
reginaboateng/cleaned_ebmnlp_pico
wikitablequestions
stanfordnlp/SHP
xquad
Dahoas/full-hh-rlhf
WizardLM/WizardLM_evol_instruct_70k
kyujinpy/KOpen-platypus
squad_it
kunishou/hh-rlhf-49k-ja
lksy/ru_instruct_gpt4
1aurent/NCT-CRC-HE
quoref
1aurent/PatchCamelyon
cardiffnlp/super_tweeteval
gsarti/change_it
lucas-meyer/asr_af
chizhikchi/CARES
svakulenk0/qrecc
taskmaster2
djaym7/wiki_dialog
qed
eugenesiow/Div2k
shahules786/orca-chat
the_pile_books3
camel-ai/physics
fnlp/moss-003-sft-data
open_subtitles
ConvLab/multiwoz21
lucas-meyer/asr_xh
eugenesiow/Set5
eugenesiow/Set14
eugenesiow/BSD100
eugenesiow/Urban100
paws
Dahoas/synthetic-instruct-gptj-pairwise
gigaword
bc2gm_corpus
CollectiveCognition/chats-data-2023-09-27
PocketDoc/Floyd-Text-Adventures
VMware/open-instruct-v1-oasst-dolly-hhrlhf
LIUM/tedlium
large_spanish_corpus
aeslc
IlyaGusev/oasst1_ru_main_branch
iamtarun/python_code_instructions_18k_alpaca
nsmc
stereoset
mbruton/spanish_srl
mbruton/galician_srl
blended_skill_talk
cardiffnlp/tweet_topic_multi
tals/vitaminc
tner/bc5cdr
common_language
nlpai-lab/kullm-v2
Open-Orca/SlimOrca
Azure99/blossom-chat-v1
bertin-project/alpaca-spanish
speech_commands
TigerResearch/tigerbot-zhihu-zh-10k
tatoeba
lmsys/lmsys-chat-1m
dair-ai/emotion
allenai/dolma
newsqa
news_commentary
amazon_us_reviews
Muennighoff/P3
DFKI-SLT/few-nerd
arcd
allenai/scirepeval
allenai/MADLAD-400
poloclub/diffusiondb
Fredithefish/openassistant-guanaco-unfiltered
deepset/germanquad
mlabonne/guanaco-llama2-1k
jondurbin/airoboros-gpt4-1.4
PocketDoc/Choose-Your-Story-Long-Text-Adventures
jondurbin/airoboros-gpt4-1.2
camel-ai/math
camel-ai/biology
camel-ai/chemistry
winglian/evals
liuhaotian/LLaVA-Instruct-150K
cats_vs_dogs
maywell/ko_wikidata_QA
IlyaGusev/ru_turbo_alpaca_evol_instruct
crows_pairs
togethercomputer/RedPajama-Data-V2
Salesforce/dialogstudio
adversarial_qa
linnaeus
ydshieh/coco_dataset_script
fmars/wiki_stem
fblgit/tree-of-knowledge
allenai/soda
vivos
jerryjalapeno/nart-100k-synthetic
big_patent
jondurbin/airoboros-gpt4-1.3
lener_br
RyokoAI/Fandom23K
milashkaarshif/MoeGirlPedia_wikitext_raw_archive
openbmb/llava_zh
liwu/MNBVC
cardiffnlp/tweet_topic_single
pubmed_qa
kunishou/oasst1-89k-ja
rufimelo/PortugueseLegalSentences-v0
nli_tr
speechcolab/gigaspeech
copenlu/fever_gold_evidence
mt_eng_vietnamese
tum-nlp/IDMGSP
berkeley-nest/Nectar
frgfm/imagenette
mozilla-foundation/common_voice_6_0
openbookqa
lex_glue
yelp_polarity
IlyaGusev/gazeta
oscar-corpus/OSCAR-2201
jigsaw_toxicity_pred
nicholasKluge/instruct-aira-dataset
iapp_wiki_qa_squad
BAAI/COIG-PC
ehartford/based
kaist-ai/Feedback-Collection
DILAB-HYU/KoQuality
duorc
hotpot_qa
NicolaiSivesind/human-vs-machine
gfissore/arxiv-abstracts-2021
AyoubChLin/CNN_News_Articles_2011-2022
yizhongw/self_instruct
NobodyExistsOnTheInternet/GiftedConvoBeforeEcons
michelecafagna26/hl
caner
cosmos_qa
sms_spam
jnlpba
commonsense_qa
nickrosh/Evol-Instruct-Code-80k-v1
commanderstrife/jnlpba
drAbreu/bc4chemd_ner
chintagunta85/ncbi_disease
shahules786/orca-best
bjoernp/tagesschau-2018-2023
HuggingFaceH4/databricks_dolly_15k
allenai/objaverse
izumi-lab/llm-japanese-dataset
nomic-ai/gpt4all-j-prompt-generations
BelleGroup/train_1M_CN
totally-not-an-llm/EverythingLM-data-V2
OpenLeecher/Teatime
NbAiLab/NST
teknium/trismegistus-project
sberquad
Abirate/english_quotes
edinburghcstr/ami
conceptual_captions
quora
poem_sentiment
conll2002
fashion_mnist
tapaco
fquad
boolq
hate_speech_filipino
arabic_billion_words
sem_eval_2018_task_1
lj_speech
squad_v1_pt
scitail
sciq
trec
silicone
wiki_qa
lince
tab_fact
opus_infopankki
id_nergrit_corpus
pn_summary
sentiment140
wmt20_mlqe_task1
cc_news
emo
circa
germeval_14
lst20
consumer-finance-complaints
un_multi
arabic_speech_corpus
wiki_atomic_edits
visual_genome
quartz
quail
math_qa
web_questions
msr_sqa
wisesight_sentiment
snow_simplified_japanese_corpus
squad_kor_v1
drop
harem
thaisum
tiny_shakespeare
ehealth_kd
jeopardy
nq_open
truthful_qa
social_i_qa
tweets_hate_speech_detection
ade_corpus_v2
medmcqa
allocine
common_gen
multi_nli_mismatch
arxiv_dataset
openai_humaneval
winograd_wsc
code_x_glue_ct_code_to_text
svhn
dutch_social
muchocine
google_wellformed_query
sbu_captions
dream
art
winogrande
ai2_arc
hate_speech_offensive
acronym_identification
health_fact
conll2000
discofuse
dbrd
bigscience/xP3mt
hatexplain
dane
brwac
wiki40b
conll2012_ontonotesv5
para_crawl
the_pile_openwebtext2
srwac
multi_woz_v22
aslg_pc12
conllpp
bible_para
fake_news_english
gem
sick
snips_built_in_intents
liar
wiki_dpr
asset
wi_locness
monash_tsf
textvqa
id_liputan6
wiki_split
xglue
wili_2018
docred
gap
indonli
empathetic_dialogues
cmrc2018
germaner
narrativeqa
wiki_auto
wider_face
conceptual_12m
medical_questions_pairs
cos_e
qasc
piqa
discovery
dbpedia_14
app_reviews
hate_speech18
species_800
quarel
cedr
id_newspapers_2018
norne
wongnai_reviews
coqa
kor_nlu
jigsaw_unintended_bias
kilt_tasks
pib
hrwac
orange_sum
iwslt2017
break_data
thaiqa_squad
peoples_daily_ner
quac
hard
lc_quad
quickdraw
reddit_tifu
hans
hlgd
blimp
head_qa
wiki_hop
wiqa
codah
definite_pronoun_resolution
hope_edi
ethos
pragmeval
humicroedit
has_part
blog_authorship_corpus
mc_taco
numer_sense
sem_eval_2010_task_8
onestop_qa
generated_reviews_enth
exams
mozilla-foundation/common_voice_6_1
scb_mt_enth_2020
mozilla-foundation/common_voice_4_0
kor_3i4k
msra_ner
clue
code_x_glue_cc_code_completion_line
igbo_english_machine_translation
gnad10
cornell_movie_dialog
climate_fever
lccc
kd_conv
mkqa
sst
told-br
tamilmixsentiment
nchlt
conv_ai_2
com_qa
setimes
multi_eurlex
newsgroup
crd3
hind_encorp
catalonia_independence
eli5_category
scielo
ted_talks_iwslt
id_clickbait
movie_rationales
web_nlg
doc2dial
giga_fren
civil_comments
sharc_modified
mocha
opus_wikipedia
emotone_ar
kor_nli
europarl_bilingual
tlc
totto
flue
kor_hate
americas_nli
miam
offenseval_dravidian
xcsr
xcopa
imdb_urdu_reviews
best2009
ted_multi
event2Mind
tsac
turku_ner_corpus
ronec
bookcorpusopen
id_panl_bppt
parsinlu_reading_comprehension
cmu_hinglish_dog
reuters21578
red_caps
hyperpartisan_news_detection
labr
hybrid_qa
prachathai67k
thai_toxicity_tweet
disfl_qa
tweet_qa
sede
turkic_xwmt
xquad_r
web_of_science
hate_speech_portuguese
mozilla-foundation/common_voice_1_0
ted_iwlst2013
opus_dgt
yahoo_answers_qa
newsph_nli
reclor
emea
para_pat
kelm
hausa_voa_ner
nkjp-ner
mozilla-foundation/common_voice_3_0
eurlex
electricity_load_diagrams
medical_dialog
makhzan
spanish_billion_words
opus_euconst
ubuntu_dialogs_corpus
wiki_bio
craigslist_bargains
recipe_nlg
nlu_evaluation_data
time_dial
Apply filters
Models
1
new
Full-text search
Edit filters
Sort: Trending
Active filters:
EchoThief
Clear all
pyannote/brouhaha
Voice Activity Detection
•
Updated
Nov 15, 2022
•
451
•
11