Hugging Face
Models
Datasets
Pricing
Resources
Website
Metrics
Languages
Organizations
Community
Forum
Blog
GitHub
Documentation
Model Hub doc
Inference API doc
Transformers doc
Tokenizers doc
Datasets doc
We're hiring!
Log In
Sign Up
Account
Log In
Sign Up
Website
Models
Datasets
Metrics
Languages
Organizations
Pricing
Community
Forum
Blog
Documentation
Model Hub doc
Inference API doc
Transformers doc
Tokenizers doc
Datasets doc
Back to tag list
Tasks
Clear
Fill-Mask
Question Answering
Summarization
Table Question Answering
Text Classification
Text Generation
Text2Text Generation
Token Classification
Translation
Zero-Shot Classification
Conversational
Text-to-Speech
Automatic Speech Recognition
Audio Source Separation
Voice Activity Detection
+ 5
Back to tag list
Libraries
Clear All
PyTorch
TensorFlow
Rust
Flair
Asteroid
TF SavedModel
ESPnet
TF Lite
Pyannote
Timm
ONNX
+ 9
Back to tag list
Datasets
Clear All
wikipedia
common_voice
squad
bookcorpus
c4
CLUECorpusSmall
dcep europarl jrc-acquis
parsinlu
oscar
squad_v2
cnn_dailymail
imagenet
conll2003
librispeech_asr
PropBank.Br
jrc-acquis
xsum
OSIAN
1.5B Arabic Corpus
gigaword
natural_questions
imagenet-21k
ontonotes
multi_nli
OSCAR Arabic Unshuffled
brWaC
wikisql
mustc
openslr
CoNLL-2012
lince
Indo4B
snli
code_search_net
OPUS
wmt19
sep_clean
mnli
xnli
blended_skill_talk
OpenLegalData
wtq
twitter
tab_fact
msr_sqa
librispeech
enh_single
ai-soco
mc4
common_crawl
xtreme
fever
trivia_qa
race
imdb
arabic_billion_words
open_subtitles
samsum
Libri1Mix
sep_noisy
openwebtext
flaubert
DAGW
piaf
emotion
cc100
ag_news
OpenSLR
quoref
docred
gap
winograd_wsc
winogender
glue
squad2
SAIL 2017
biomedical literature from Scielo and Pubmed
w11wo/imdb-javanese
openbookqa
Libri2Mix
Libri3Mix
web_questions
wiki_dpr
FQuAD
SQuAD-FR
cc_news
PubMed
id_liputan6
reddit singapore, malaysia
hardwarezone
interspeech_2021_asr
ms_marco
array of dataset identifiers
opus100
jsut
wmt16
ComVE
voxceleb
dihard
wham
Universal Dependencies
commonsenseqa
arc
qqp
the Pile
id_newspapers_2018
STSbenchmark
BFD
dindebat.dk
hestenettet.dk
danish OpenSubtitles
AI4Bharat IndicNLP Corpora
anli
mlqa
MIMIC-III
Wikipedia
go_emotions
tydiqa
pubmed
arabic_speech_corpus
fquad
scientific_papers
NQ
Trivia
SQuAD
MLQA
DRCD
Indonesian Wikipedia
arcd
common_gen
germeval_14
MS MARCO document ranking
wikipedia-turkish
mulit_nli
wmt14
MNLI
wer
TQUAD
timit_asr
indosum
NST Swedish ASR Database
EMBO/sd-panels
CSS10
sts
scancode-rules
trec
Twitter
IndianPolitics
conll2000
ljspeech
muchocine
break_data
https://arabicspeech.org/
sst-2
Shuffled Dutch section of the OSCAR corpus (https://oscar-corpus.com/)
msmarco
yahoo-answers
Uniref100
SciDocs
220M words (IndoWiki, IndoWC, News)
squad_v1
nadi
eli5
parlament_parla
Marefa-NER
multi_nli_mismatch
CommonCrawl
triviaqa
coqa
mlsum
Wikihow
Jean-Baptiste/wikiner_fr
UniRef50
Arabic poetry from several eras
tweets_hate_speech_detection
Interspeech 2021
xsum_nl
xquad
discofuse
webqa
dureader
emo
bioASQ
sail
yelp_polarity
Squad
XQuad
Tydiqa
susumu2357/squad_v2_sv
movies
cord19
arxiv_dataset
quora
100GB Chinese corpus
vivos
custom-book-corpus
squad_v1_pt
quartz
Spotify Podcasts Dataset
sms_spam
wiki-mk
time-mk-news-2010-2015
legal entity recognition
masakhaner
Indic TTS Malayalam Speech Corpus
Openslr Malayalam Speech Corpus
SMC Malayalam Speech Corpus
IIIT-H Indic Speech Databases
shemo
created a new dataset based on https://www.openslr.org/92/
common_voice, infore_25h
Wikipedia (Hindi, Sanskrit, Gujarati)
google_wellformed_query
cifar10
Arabic Wikipedia
marefa-mt
socian
bangla-sentiment-benchmark
LJSpeech
LibriTTS
RuSentiment
MLSUM
CC-aligned
quotes-500K
EMBO/sd-nlp
bible_para
L3CubeMahaSent
common_voice mn
codexglue
XSUM
Gigaword
ALFFA,Gamayun & IWSLT
google
Oscar Corpus, News, Stories
event2Mind
ai2_arc
BembaSpeech
Icelandic portion of the OSCAR corpus from INRIA
JW300 + [Menyo-20k](https://huggingface.co/datasets/menyo20k_mt)
EMBO/biolang
augmented_codesearchnet
pytorrent
HARD-Arabic-Dataset
JW300
Finnish parliament session 2
CC100
kazakh_speech_corpus
custom danish dataset
https://github.com/wangcunxiang/SemEval2020-Task4-Commonsense-Validation-and-Explanation
coco
RuTweetCorp
RuReviews
fon_dataset
https://github.com/staeiou/arxiv_archive/tree/v1.0.1
Tesserae
Phi5
Thomas Aquinas
algebra_linear_1d
algebra_linear_1d_composed
measurement_time
numbers_gcd
kowiki
news
OpenSLR 77
DaNE
legal
DAMP-VSEP
tatoeba
setimes
csmsc
SUC 3.0
sqa
libri1mix
qasc
quarel
mgb5
TACDataset
ami
voxconverse
Farasa
openslr_hindi
urdu-text-news
Voicebank
DEMAND
WHAM!
WHAMR!
WSJ0-2Mix
WSJ0-3Mix
Timers and Such
wikimovies
imagenet_21k
mlsum - es
Yves/fhnw_swiss_parliament
+ 285
Back to tag list
Languages
Clear All
en
es
fr
sv
de
fi
multilingual
zh
ru
ar
fa
it
id
pt
tr
nl
uk
eo
ja
pl
da
Chinese
bg
ro
he
el
hi
no
cs
lt
ca
af
et
vi
hu
sl
is
ms
ko
ht
hr
mr
tl
bn
mt
lv
gl
gu
mk
eu
ig
rw
ur
lg
ny
or
sn
xh
ee
ts
ln
yo
as
si
mn
rn
ga
be
jv
sm
ta
ty
to
nso
fy
ha
lb
sq
te
yi
nb
fj
nn
gaa
bcl
crs
guw
tn
niu
co
wa
ceb
cy
ka
st
br
mh
fo
ilo
bzs
iso
efi
pap
pon
pis
gil
lua
pag
rm
oc
an
am
hy
sk
th
zu
ti
tw
kg
bem
swc
tll
tvl
lus
loz
ml
english
gv
ase
bi
war
lu
hil
lue
gd
km
kn
so
os
ps
se
kqn
srn
toi
mg
tt
wo
kw
ho
tiv
wls
zne
run
tpi
ne
az
cv
kwy
ber
chk
tum
mfe
sc
yap
rnd
ve
c++
mi
sw
tk
dv
yue
mos
sh
roa
my
code
protein
ky
lo
su
na
ba
sa
pa-IN
fy-NL
kk
bo
bs
pa
sr
sla
kl
io
ce
ab
ISO 639-1 code for your language, or `multilingual`
gem
luo
sv-SE
fiu
gmq
itc
umb
zls
gmw
cel
zle
iir
afa
sem
urj
cpp
ine
inc
zlw
French
sah
hsb
la
eng
jap
ch
gn
nv
mul
zh-tw
lzh
py
grk
ga-IE
kj
trk
bat
phi
ss
om
euq
dra
ng
nyk
fse
kwn
bnt
lun
aav
alv
csg
pqe
csn
aed
cpf
cus
mkh
nic
sal
mfs
prl
tzo
zai
Cszech
Deustch
Swedish
ia
rm-sursilv
cnr
hbs
haw
hmn
ku
tg
ug
uz
vn
dutch
italian
scientific english
???
ks
sd
hi-en
[en]
amh
hau
ibo
kin
lug
pcm
swa
wol
yor
nah specifically ncj
zh-HK
Guj
tut
cau
kab
ssp
scn
nap
taw
Deustch English
scandinavia
art
ccs
map
poz
pqw
sit
tdt
yua
sami
vsl
wal
ach
Cszech Deustch
Cszech English
Cszech Spanish
Cszech French
Cszech Italian
Cszech Swedish
Deustch Cszech
Deustch Spanish
Deustch French
Deustch Italian
Deustch Swedish
English Cszech
English Deustch
English Italian
French Cszech
French Deustch
French English
French Spanish
French Italian
French Swedish
Italian Cszech
Italian Deustch
Italian English
Italian Spanish
Italian French
Italian Swedish
Swedish Cszech
Swedish Deustch
Swedish English
Swedish Spanish
Swedish French
Swedish Italian
rm-vallader
fon
Go
Java
javascript
php
python
en de nl es
cnh
nr
sot
ven
xho
zul
arz
ary
esperanto
xal
+ 357
Back to tag list
Licenses
Clear All
apache-2.0
mit
cc-by-sa-3.0
cc-by-4.0
cc by-nc-sa 4.0
gpl-3.0
apache 2.0
cc-by-sa-4.0
cc-by-nc-4.0
public domain notice
cc-by 4.0
any valid license identifier
apache-2
apache license 2.0
apache
attribution-sharealike 4.0 international
gnu gplv3
cc0
+ 16
Models
6
Sort:
Most Downloads
Most Downloads
Alphabetical
Recently Updated
Helsinki-NLP/opus-mt-chk-en
Translation
•
Updated
Jan 18
•
49
Helsinki-NLP/opus-mt-en-chk
Translation
•
Updated
Jan 18
•
37
Helsinki-NLP/opus-mt-chk-es
Translation
•
Updated
Jan 18
•
36
Helsinki-NLP/opus-mt-chk-sv
Translation
•
Updated
Jan 18
•
33
Helsinki-NLP/opus-mt-chk-fr
Translation
•
Updated
Jan 18
•
32
Helsinki-NLP/opus-mt-sv-chk
Translation
•
Updated
Aug 21, 2020
•
29