Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
1
Languages
Licenses
Other
Reset Datasets
No match found for active filter
Datasets with no match
mozilla-foundation/common_voice_7_0
imagenet-1k
imdb
xtreme
wikipedia
mozilla-foundation/common_voice_11_0
common_voice
conll2003
tweet_eval
Open-Orca/OpenOrca
marsyas/gtzan
samsum
bookcorpus
fka/awesome-chatgpt-prompts
clinc_oos
OpenAssistant/oasst1
LDJnr/Capybara
c4
kde4
cnn_dailymail
Intel/orca_dpo_pairs
jondurbin/airoboros-2.2.1
facebook/voxpopuli
garage-bAInd/Open-Platypus
mozilla-foundation/common_voice_13_0
super_glue
Open-Orca/SlimOrca
bigcode/starcoderdata
ag_news
HuggingFaceH4/ultrachat_200k
PolyAI/minds14
google/fleurs
billsum
cerebras/SlimPajama-627B
databricks/databricks-dolly-15k
teknium/openhermes
HuggingFaceH4/ultrafeedback_binarized
beans
librispeech_asr
oscar
huggan/smithsonian_butterflies_subset
universal_dependencies
teknium/OpenHermes-2.5
wmt16
tiiuae/falcon-refinedweb
Anthropic/hh-rlhf
mc4
mozilla-foundation/common_voice_8_0
togethercomputer/RedPajama-Data-1T
TIGER-Lab/MathInstruct
migtissera/Synthia-v1.3
allenai/ultrafeedback_binarized_cleaned
tatsu-lab/alpaca
wnut_17
cc100
amazon_reviews_multi
WizardLM/WizardLM_evol_instruct_V2_196k
cais/mmlu
meta-math/MetaMathQA
ise-uiuc/Magicoder-OSS-Instruct-75K
food101
ise-uiuc/Magicoder-Evol-Instruct-110K
lmsys/lmsys-chat-1m
piqa
jondurbin/truthy-dpo-v0.1
sst2
unalignment/toxic-dpo-v0.1
HuggingFaceH4/no_robots
snli
gsm8k
jondurbin/airoboros-3.1
klue
Muennighoff/natural-instructions
Vezora/Tested-22k-Python-Alpaca
opus_books
scene_parse_150
spider
relbert/semeval2012_relational_similarity
jondurbin/cinematika-v0.1
ms_marco
codeparrot/apps
facebook/belebele
cakiki/rosetta-code
superb
eli5
lemonilia/LimaRP
jondurbin/airoboros-3.2
yahma/alpaca-cleaned
LDJnr/Verified-Camel
OpenAssistant/oasst_top1_2023-08-25
wikitext
xnli
kingbri/PIPPA-shareGPT
cifar10
stanfordnlp/SHP
unalignment/spicy-3.1
wikiann
EleutherAI/pile
cppe-5
financial_phrasebank
nvidia/HelpSteer
natural_questions
PygmalionAI/PIPPA
LDJnr/Pure-Dove
code_search_net
argilla/distilabel-intel-orca-dpo-pairs
uonlp/CulturaX
GAIR/lima
Squish42/bluemoon-fandom-1-1-rp-cleaned
mozilla-foundation/common_voice_16_0
esb/datasets
anon8231489123/ShareGPT_Vicuna_unfiltered
yelp_review_full
bigcode/the-stack-dedup
pubmed
Lajonbot/alpaca-dolly-chrisociepa-instruction-only-polish
yahoo_answers_topics
jondurbin/airoboros-2.2
mnist
stsb_multi_mt
jondurbin/airoboros-gpt4-1.4.1
pg19
gokuls/wiki_book_corpus_complete_processed_bert_dataset
mozilla-foundation/common_voice_16_1
flax-sentence-embeddings/stackexchange_xml
LDJnr/LessWrong-Amplify-Instruct
camel-ai/physics
wikimedia/wikipedia
dell-research-harvard/AmericanStories
embedding-data/sentence-compression
unicamp-dl/mmarco
swag
varun-v-rao/squad
wikihow
camel-ai/math
camel-ai/biology
camel-ai/chemistry
allenai/nllb
assin2
lmqg/qg_squad
search_qa
vicgalle/alpaca-gpt4
Open-Orca/SlimOrca-Dedup
openai/summarize_from_feedback
HuggingFaceM4/WebSight
kejian/codeparrot-train-more-filter-3.3b-cleaned
gooaq
lmqg/qg_subjqa
embedding-data/WikiAnswers
go_emotions
b-mc2/sql-create-context
argilla/distilabel-capybara-dpo-7k-binarized
jondurbin/gutenberg-dpo-v0.1
embedding-data/PAQ_pairs
wikisql
mozilla-foundation/common_voice_9_0
timdettmers/openassistant-guanaco
embedding-data/altlex
embedding-data/simple-wiki
banking77
stingning/ultrachat
HuggingFaceTB/cosmopedia
NbAiLab/NPSC
QingyiSi/Alpaca-CoT
tner/tweetner7
mbpp
masakhaner
liuhaotian/LLaVA-Instruct-150K
mlabonne/chatml_dpo_pairs
assin
openchat/openchat_sharegpt4_dataset
multi_news
detection-datasets/coco
embedding-data/SPECTER
allenai/tulu-v2-sft-mixture
sahil2801/CodeAlpaca-20k
ncbi_disease
bigcode/the-stack
rotten_tomatoes
openslr
argilla/distilabel-math-preference-dpo
wiki_lingua
scientific_papers
glaiveai/glaive-function-calling-v2
LeoLM/OpenSchnabeltier
totally-not-an-llm/EverythingLM-data-V3
LeoLM/German_Poems
LeoLM/German_Songs
argilla/dpo-mix-7k
WizardLM/WizardLM_evol_instruct_70k
JosephusCheung/GuanacoDataset
vctk
oscar-corpus/OSCAR-2301
Gustavosta/Stable-Diffusion-Prompts
fblgit/tree-of-knowledge
argilla/ultrafeedback-binarized-preferences-cleaned
LDJnr/Puffin
Norquinal/claude_multiround_chat_30k
jondurbin/airoboros-gpt4-m2.0
Nerfgun3/bad_prompt
THUDM/AgentInstruct
unalignment/toxic-dpo-v0.2
facebook/multilingual_librispeech
lmqg/qg_squadshifts
allenai/c4
m-a-p/Code-Feedback
yhavinga/mc4_nl_cleaned
beomi/KoAlpaca-v1.1a
glaiveai/glaive-code-assistant
NbAiLab/NST
abacusai/SystemChat
berkeley-nest/Nectar
bigbench
multilingual_librispeech
allenai/dolma
gsdf/EasyNegative
race
mlsum
kunishou/databricks-dolly-15k-ja
lambdalabs/pokemon-blip-captions
Doctor-Shotgun/no-robots-sharegpt
esnli
metaeval/reclor
Hello-SimpleAI/HC3
kmfoda/booksum
teknium/GPTeacher-General-Instruct
liuhaotian/LLaVA-Pretrain
IlyaGusev/ru_turbo_alpaca
Doctor-Shotgun/capybara-sharegpt
mrqa
winogrande
grimulkan/LimaRP-augmented
derek-thomas/ScienceQA
TIGER-Lab/ScienceEval
m-a-p/CodeFeedback-Filtered-Instruction
lmqg/qg_jaquad
covid_qa_deepset
riddle_sense
kaist-ai/Feedback-Collection
roneneldan/TinyStories
JeanKaddour/minipile
jondurbin/airoboros-2.1
covost2
jytjyt05/t_to_m7
openbmb/UltraFeedback
mlabonne/guanaco-llama2-1k
allenai/MADLAD-400
mattpscott/airoboros-summarization
lmqg/qg_esquad
lmqg/qg_ruquad
fever
lambada
FreedomIntelligence/alpaca-gpt4-deutsch
FreedomIntelligence/evol-instruct-deutsch
allenai/objaverse
chargoddard/rpguild
WhiteRabbitNeo/WRN-Chapter-1
WhiteRabbitNeo/WRN-Chapter-2
tydiqa
bigcode/guanaco-commits
EleutherAI/the_pile_deduplicated
openai/webgpt_comparisons
OpenAssistant/OASST-DE
CohereForAI/aya_dataset
nlpai-lab/kullm-v2
grimulkan/theory-of-mind
jondurbin/contextual-dpo-v0.1
timit_asr
flozi00/conversations
daily_dialog
lmqg/qg_itquad
deepmind/code_contests
cognitivecomputations/dolphin
csebuetnlp/xlsum
cifar100
argilla/ultrafeedback-binarized-preferences
jondurbin/py-dpo-v0.1
ParisNeo/lollms_aware_dataset
NeuralNovel/Neural-Story-v1
lmqg/qg_koquad
iamplus/Instruction_Tuning
aqua_rat
OpenAssistant/oasst2
laion/OIG
pszemraj/simple_wikipedia_LM
mozilla-foundation/common_voice_12_0
cats_vs_dogs
AmazonScience/massive
paws-x
cognitivecomputations/dolphin-coder
RyokoAI/ShareGPT52K
lmqg/qg_dequad
ami
mozilla-foundation/common_voice_10_0
amazon_polarity
tiedong/goat
kyujinpy/OpenOrca-KO
STEM-AI-mtl/Electrical-engineering
dair-ai/emotion
IlyaGusev/ru_turbo_saiga
mlqa
iamtarun/python_code_instructions_18k_alpaca
squad_es
german-nlp-group/german_common_crawl
svakulenk0/qrecc
taskmaster2
djaym7/wiki_dialog
qed
cuad
sablo/oasst2_curated
stjiris/portuguese-legal-sentences-v0
kyujinpy/KOR-OpenOrca-Platypus-v3
maywell/ko_wikidata_QA
BAAI/COIG
nvidia/OpenMathInstruct-1
IlyaGusev/ru_sharegpt_cleaned
lksy/ru_instruct_gpt4
Finnish-NLP/mc4_fi_cleaned
subjqa
wmt19
emozilla/yarn-train-tokenized-16k-mistral
laion/laion2B-en
mosaicml/dolly_hhrlhf
bigscience/P3
wmt14
athirdpath/DPO_Pairs-Roleplay-Alpaca-NSFW
allenai/ai2_arc
microsoft/orca-math-word-problems-200k
Yaxin/SemEval2014Task4Raw
competitions/aiornot
nsmc
seedboxai/multitask_german_examples_32k
shahules786/orca-chat
conceptual_captions
argilla/OpenHermes2.5-dpo-binarized-alpha
kyujinpy/KOpen-platypus
c-s-ale/alpaca-gpt4-data
bigscience/xP3
EleutherAI/proof-pile-2
imone/OpenOrca_FLAN
nomic-ai/gpt4all_prompt_generations
togethercomputer/RedPajama-Data-1T-Sample
Azure99/blossom-chat-v1
teknium/GPT4-LLM-Cleaned
Norquinal/claude_multiround_chat_1k
bertin-project/alpaca-spanish
jondurbin/airoboros-3.0
segments/sidewalk-semantic
HuggingFaceH4/cai-conversation-harmless
IlyaGusev/oasst1_ru_main_branch
fnlp/moss-003-sft-data
squad_it
cardiffnlp/super_tweeteval
mozilla-foundation/common_voice_15_0
Salesforce/dialogstudio
reginaboateng/cleaned_ebmnlp_pico
eugenesiow/Div2k
wikitablequestions
kunishou/hh-rlhf-49k-ja
nli_tr
ydshieh/coco_dataset_script
mattymchen/refinedweb-3m
Dahoas/full-hh-rlhf
xquad
CollectiveCognition/chats-data-2023-09-27
large_spanish_corpus
Locutusque/hyperion-v2.0
TigerResearch/tigerbot-zhihu-zh-10k
1aurent/NCT-CRC-HE
1aurent/PatchCamelyon
quoref
gsarti/change_it
lucas-meyer/asr_af
iamplus/Conversational_Data
chizhikchi/CARES
paws
DFKI-SLT/few-nerd
eugenesiow/Set5
augmxnt/ultra-orca-boros-en-ja-v1
visual_genome
big_patent
bjoernp/ultrachat_de
Epiculous/Gnosis
tatoeba
LIUM/tedlium
IlyaGusev/ru_turbo_alpaca_evol_instruct
ConvLab/multiwoz21
open_subtitles
lucas-meyer/asr_xh
tiagoblima/qg_squad_v1_pt
deepset/germanquad
Dahoas/synthetic-instruct-gptj-pairwise
eugenesiow/Set14
eugenesiow/BSD100
eugenesiow/Urban100
bjoernp/tagesschau-2018-2023
common_language
CohereForAI/aya_collection
the_pile_books3
jondurbin/airoboros-gpt4-1.2
gigaword
pankajmathur/orca_mini_v1_dataset
togethercomputer/RedPajama-Data-V2
vivos
VMware/open-instruct-v1-oasst-dolly-hhrlhf
aeslc
HuggingFaceH4/CodeAlpaca_20K
Lin-Chen/ShareGPT4V
newsqa
mt_eng_vietnamese
amazon_us_reviews
mbruton/galician_srl
mbruton/spanish_srl
eli5_category
Norquinal/OpenCAI
tals/vitaminc
bigcode/commitpackft
THUDM/webglm-qa
cardiffnlp/tweet_topic_multi
blended_skill_talk
poloclub/diffusiondb
IlyaGusev/gazeta
snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
izumi-lab/llm-japanese-dataset
speech_commands
jondurbin/bagel-v0.3
PocketDoc/Floyd-Text-Adventures
winglian/evals
bigbio/med_qa
MBZUAI/Bactrian-X
cognitivecomputations/samantha-data
NeuralNovel/Neural-DPO
ResplendentAI/Synthetic_Soul_1k
RyokoAI/Fandom23K
milashkaarshif/MoeGirlPedia_wikitext_raw_archive
openbmb/llava_zh
liwu/MNBVC
tner/bc5cdr
bc2gm_corpus
EarthnDusk/Embeddings
speechcolab/gigaspeech
hkust-nlp/deita-6k-v0
BangumiBase/lapisrelights
allenai/scirepeval
lener_br
cosmos_qa
Muennighoff/P3
kunishou/oasst1-89k-ja
kakaobrain/coyo-700m
sbu_captions
Universal-NER/Pile-NER-type
conll2002
allenai/soda
Fredithefish/openassistant-guanaco-unfiltered
Severian/Biomimicry
arcd
Himitsui/Lewd-Assistant-v1
glaiveai/glaive-code-assistant-v2
PocketDoc/Choose-Your-Story-Long-Text-Adventures
adamo1139/AEZAKMI_v2
mlabonne/CodeLlama-2-20k
jondurbin/airoboros-gpt4-1.4
Abirate/english_quotes
crows_pairs
McGill-NLP/WebLINX
mozilla-foundation/common_voice_6_0
sms_spam
nomic-ai/gpt4all-j-prompt-generations
fmars/wiki_stem
Severian/Bio-Design-Process
Azure99/blossom-math-v2
Azure99/blossom-wizard-v1
Azure99/blossom-orca-v1
iapp_wiki_qa_squad
Locutusque/Hercules-v3.0
jerryjalapeno/nart-100k-synthetic
TokenBender/code_instructions_122k_alpaca_style
BelleGroup/train_1M_CN
jondurbin/airoboros-gpt4-1.3
nicholasKluge/instruct-aira-dataset
cardiffnlp/tweet_topic_single
NicolaiSivesind/human-vs-machine
gfissore/arxiv-abstracts-2021
rufimelo/PortugueseLegalSentences-v0
copenlu/fever_gold_evidence
McGill-NLP/WebLINX-full
linnaeus
teknium/trismegistus-project
frgfm/imagenette
tum-nlp/IDMGSP
CyberHarem/surtr_arknights
BangumiBase/seitokaiyakuindomo
yelp_polarity
Amod/mental_health_counseling_conversations
SetFit/bbc-news
oscar-corpus/OSCAR-2201
epfl-llm/guidelines
KnutJaegersberg/Auton
AyoubChLin/CNN_News_Articles_2011-2022
ajibawa-2023/Code-290k-ShareGPT
NobodyExistsOnTheInternet/full120k
HuggingFaceH4/deita-10k-v0-sft
yizhongw/self_instruct
Severian/Internal-Knowledge-Map
NobodyExistsOnTheInternet/GiftedConvoBeforeEcons
DILAB-HYU/KoQuality
BAAI/COIG-PC
vietgpt/wikipedia_vi
pn_summary
s3nh/alpaca-dolly-instruction-only-polish
hotpot_qa
caner
michelecafagna26/hl
edinburghcstr/ami
iamplus/Orca
hltcoe/tdist-msmarco-scores
wmt20_mlqe_task1
tasksource/mmlu
poem_sentiment
jnlpba
starfishmedical/webGPT_x_dolly
Babelscape/multinerd
euirim/goodwiki
pankajmathur/WizardLM_Orca
Thaweewat/alpaca-cleaned-52k-th
sem_eval_2018_task_1
totally-not-an-llm/EverythingLM-data-V2
knowrohit07/saraswati-stem
meta-math/MetaMathQA-40K
icybee/share_gpt_90k_v1
mlabonne/chatml-OpenHermes2.5-dpo-binarized-alpha
fashion_mnist
nickrosh/Evol-Instruct-Code-80k-v1
arxiv_dataset
OpenLeecher/Teatime
indonli
sberquad
SkelterLabsInc/JaQuAD
commanderstrife/jnlpba
chintagunta85/ncbi_disease
lj_speech
sentiment140
medalpaca/medical_meadow_wikidoc
BangumiBase/fatestaynightufotable
quora
wiki_qa
fquad
conll2012_ontonotesv5
tapaco
arabic_billion_words
hate_speech_filipino
opus_infopankki
truthful_qa
trec
silicone
squad_v1_pt
lince
tab_fact
emo
circa
cc_news
dbrd
lst20
germeval_14
thaisum
muchocine
squad_kor_v1
consumer-finance-complaints
arabic_speech_corpus
quail
math_qa
ethos
hate_speech18
wiki_atomic_edits
web_questions
wisesight_sentiment
msr_sqa
allocine
svhn
snow_simplified_japanese_corpus
srwac
tiny_shakespeare
ehealth_kd
id_nergrit_corpus
google_wellformed_query
social_i_qa
tweets_hate_speech_detection
health_fact
ade_corpus_v2
nq_open
multi_nli_mismatch
harem
id_liputan6
winograd_wsc
dutch_social
conllpp
conll2000
dream
art
app_reviews
acronym_identification
openai_humaneval
dane
discofuse
bigscience/xP3mt
iwslt2017
wiki40b
code_x_glue_ct_code_to_text
hatexplain
brwac
empathetic_dialogues
cmrc2018
code_x_glue_cc_defect_detection
multi_woz_v22
aslg_pc12
bible_para
fake_news_english
gem
wider_face
sick
snips_built_in_intents
liar
wiki_dpr
monash_tsf
wi_locness
wiki_split
para_crawl
textvqa
xglue
wili_2018
scb_mt_enth_2020
docred
gap
thaiqa_squad
narrativeqa
germaner
the_pile_openwebtext2
reddit_tifu
wiki_auto
medical_questions_pairs
cos_e
head_qa
discovery
species_800
quarel
sem_eval_2010_task_8
norne
wongnai_reviews
conceptual_12m
break_data
cedr
id_newspapers_2018
mozilla-foundation/common_voice_6_1
kilt_tasks
kor_nlu
pib
conv_ai_2
hrwac
setimes
climate_fever
hard
peoples_daily_ner
orange_sum
quac
lc_quad
quickdraw
hans
hlgd
wiki_hop
wiqa
codah
definite_pronoun_resolution
hope_edi
pragmeval
humicroedit
has_part
blog_authorship_corpus
mc_taco
numer_sense
onestop_qa
generated_reviews_enth
gnad10
kor_3i4k
cornell_movie_dialog
ted_multi
mkqa
lccc
kd_conv
mozilla-foundation/common_voice_4_0
parsinlu_reading_comprehension
web_nlg
nchlt
com_qa
kor_nli
catalonia_independence
web_of_science
tamilmixsentiment
jeopardy
reuters21578
labr
ted_talks_iwslt
crd3
scielo
clue
newsgroup
sst
nlu_evaluation_data
mozilla-foundation/common_voice_3_0
hind_encorp
giga_fren
movie_rationales
id_clickbait
doc2dial
craigslist_bargains
mozilla-foundation/common_voice_14_0
sharc_modified
mocha
emotone_ar
code_x_glue_cc_code_completion_line
igbo_english_machine_translation
opus_wikipedia
ronec
ptb_text_only
turku_ner_corpus
xor_tydi_qa
miam
offenseval_dravidian
xcsr
xcopa
flue
best2009
totto
event2Mind
tlc
imdb_urdu_reviews
cmu_hinglish_dog
bsd_ja_en
hybrid_qa
offenseval2020_tr
kor_hate
turkic_xwmt
medal
newsph_nli
prachathai67k
thai_toxicity_tweet
msra_ner
proto_qa
sede
red_caps
disfl_qa
electricity_load_diagrams
tsac
id_panl_bppt
hyperpartisan_news_detection
reclor
conv_ai
conv_ai_3
ted_iwlst2013
eurlex
menyo20k_mt
told-br
hausa_voa_ner
code_x_glue_tc_text_to_code
xsum_factuality
tweet_qa
opus_dgt
yahoo_answers_qa
hate_speech_portuguese
xquad_r
emea
para_pat
nkjp-ner
mozilla-foundation/common_voice_1_0
squad_adversarial
math_dataset
squad_kor_v2
medical_dialog
makhzan
spanish_billion_words
ubuntu_dialogs_corpus
wiki_bio
kelm
recipe_nlg
time_dial
reasoning_bg
Apply filters
Models
2
new
Full-text search
Edit filters
Sort: Trending
Active filters:
wit
Clear all
clip-italian/clip-italian-final
Updated
Jul 18, 2021
•
5
clip-italian/clip-italian
Feature Extraction
•
Updated
Mar 16, 2023
•
359
•
13