Languages
This table displays the number of mono-lingual (or "few"-lingual, with "few" arbitrarily set to 5 or less) models and datasets, by language. You can click on the figures on the right to the lists of actual models and datasets.
Multilingual models are listed here, while multilingual datasets are listed there.
Language | ISO code | Datasets | Models |
---|---|---|---|
English English | en
|
2,294 | 12,251 |
Spanish Español | es
|
265 | 1,034 |
French Français | fr
|
254 | 1,069 |
German Deutsch | de
|
234 | 825 |
Russian Русский | ru
|
208 | 560 |
Portuguese Português | pt
|
197 | 560 |
Chinese 中文 | zh
|
195 | 867 |
Arabic اللغة العربية | ar
|
173 | 523 |
Italian Italiano | it
|
164 | 493 |
Polish język polski | pl
|
147 | 274 |
Dutch Nederlands | nl
|
145 | 382 |
Hindi हिन्दी | hi
|
140 | 403 |
Korean 한국어 | ko
|
139 | 376 |
Indonesian Bahasa Indonesia | id
|
138 | 344 |
Japanese 日本語 | ja
|
131 | 599 |
Swedish Svenska | sv
|
130 | 448 |
Turkish Türkçe | tr
|
127 | 377 |
Finnish suomi | fi
|
125 | 399 |
Catalan Català | ca
|
124 | 258 |
Bengali বাংলা | bn
|
120 | 232 |
Danish dansk | da
|
118 | 227 |
Romanian Română | ro
|
118 | 301 |
Czech čeština | cs
|
115 | 200 |
Tamil தமிழ் | ta
|
110 | 217 |
Vietnamese Tiếng Việt | vi
|
106 | 373 |
Slovenian slovenski jezik | sl
|
105 | 180 |
Greek Ελληνικά | el
|
104 | 213 |
Thai ไทย | th
|
102 | 252 |
Hungarian magyar | hu
|
101 | 195 |
Telugu తెలుగు | te
|
101 | 187 |
Estonian eesti | et
|
94 | 227 |
Persian فارسی | fa
|
92 | 280 |
Urdu اردو | ur
|
92 | 208 |
Malayalam മലയാളം | ml
|
91 | 173 |
Bulgarian български език | bg
|
90 | 199 |
Latvian latviešu valoda | lv
|
86 | 133 |
Marathi मराठी | mr
|
86 | 211 |
Lithuanian lietuvių kalba | lt
|
83 | 159 |
Slovak slovenčina | sk
|
81 | 130 |
Swahili Kiswahili | sw
|
81 | 210 |
Croatian Hrvatski | hr
|
80 | 157 |
Kannada ಕನ್ನಡ | kn
|
78 | 144 |
Gujarati ગુજરાતી | gu
|
77 | 174 |
Basque euskara | eu
|
76 | 159 |
Panjabi ਪੰਜਾਬੀ | pa
|
76 | 153 |
Ukrainian Українська | uk
|
75 | 323 |
Maltese Malti | mt
|
73 | 85 |
Hebrew עברית | he
|
71 | 147 |
Icelandic Íslenska | is
|
70 | 184 |
Norwegian Norsk | no
|
70 | 192 |
Oriya ଓଡ଼ିଆ | or
|
65 | 124 |
Serbian српски језик | sr
|
65 | 139 |
Irish Gaeilge | ga
|
64 | 108 |
Sinhala සිංහල | si
|
63 | 126 |
Afrikaans Afrikaans | af
|
62 | 137 |
Assamese অসমীয়া | as
|
58 | 146 |
Nepali नेपाली | ne
|
56 | 152 |
Amharic አማርኛ | am
|
55 | 108 |
Welsh Cymraeg | cy
|
55 | 125 |
Malay Bahasa Malaysia | ms
|
55 | 125 |
Burmese ဗမာစာ | my
|
55 | 104 |
Tagalog Wikang Tagalog | tl
|
55 | 107 |
Yoruba Yorùbá | yo
|
55 | 121 |
Armenian Հայերեն | hy
|
53 | 112 |
Georgian ქართული | ka
|
53 | 122 |
Galician galego | gl
|
52 | 132 |
Belarusian беларуская мова | be
|
51 | 132 |
Khmer ខេមរភាសា | km
|
51 | 86 |
Azerbaijani azərbaycan dili | az
|
50 | 106 |
Albanian Shqip | sq
|
50 | 98 |
Esperanto Esperanto | eo
|
48 | 123 |
Kazakh қазақ тілі | kk
|
48 | 100 |
Uzbek Ўзбек | uz
|
48 | 94 |
Breton brezhoneg | br
|
47 | 79 |
Igbo Asụsụ Igbo | ig
|
47 | 102 |
Kinyarwanda Ikinyarwanda | rw
|
47 | 84 |
Hausa هَوُسَ | ha
|
46 | 111 |
Kyrgyz Кыргызча | ky
|
46 | 81 |
Macedonian македонски јазик | mk
|
46 | 126 |
Latin latine | la
|
45 | 82 |
Norwegian Nynorsk Norsk nynorsk | nn
|
45 | 70 |
Somali Soomaaliga | so
|
45 | 96 |
Kurdish Kurdî | ku
|
44 | 61 |
Mongolian Монгол хэл | mn
|
44 | 128 |
Xhosa isiXhosa | xh
|
44 | 121 |
Pashto پښتو | ps
|
43 | 95 |
Zulu isiZulu | zu
|
43 | 78 |
Cebuano | ceb
|
43 | 55 |
Tajik тоҷикӣ | tg
|
42 | 49 |
Western Frisian Frysk | fy
|
41 | 73 |
Tatar татар теле | tt
|
41 | 44 |
Luxembourgish Lëtzebuergesch | lb
|
40 | 74 |
Norwegian Bokmål Norsk bokmål | nb
|
40 | 62 |
Scottish Gaelic Gàidhlig | gd
|
39 | 75 |
Javanese basa Jawa | jv
|
39 | 87 |
Bosnian bosanski jezik | bs
|
38 | 72 |
Sanskrit संस्कृतम् | sa
|
38 | 60 |
Uyghur ئۇيغۇرچە | ug
|
38 | 43 |
Yiddish ייִדיש | yi
|
38 | 76 |
Lao ພາສາ | lo
|
37 | 76 |
Haitian Kreyòl ayisyen | ht
|
36 | 80 |
Ganda Luganda | lg
|
36 | 76 |
Occitan occitan | oc
|
36 | 46 |
Sindhi सिन्धी | sd
|
36 | 73 |
Wolof Wollof | wo
|
36 | 62 |
Tibetan བོད་ཡིག | bo
|
34 | 27 |
Faroese føroyskt | fo
|
34 | 41 |
Malagasy fiteny malagasy | mg
|
34 | 80 |
Pedi | nso
|
34 | 53 |
Divehi Dhivehi | dv
|
33 | 20 |
Quechua Runa Simi | qu
|
33 | 9 |
Turkmen Türkmen | tk
|
33 | 27 |
Tswana Setswana | tn
|
33 | 59 |
Interlingua Interlingua | ia
|
32 | 12 |
Shona chiShona | sn
|
32 | 89 |
Chuvash чӑваш чӗлхи | cv
|
31 | 22 |
Chichewa chiCheŵa | ny
|
31 | 77 |
Southern Sotho Sesotho | st
|
31 | 69 |
Bambara bamanankan | bm
|
30 | 40 |
Guaraní Avañe'ẽ | gn
|
30 | 27 |
Romansh rumantsch grischun | rm
|
30 | 18 |
Sundanese Basa Sunda | su
|
30 | 86 |
Māori te reo Māori | mi
|
29 | 41 |
Tsonga Xitsonga | ts
|
29 | 56 |
Waray | war
|
29 | 28 |
Yakut саха тыла | sah
|
29 | 14 |
Bashkir башҡорт теле | ba
|
28 | 43 |
Ido Ido | io
|
28 | 11 |
Lingala Lingála | ln
|
28 | 76 |
Kirundi Ikirundi | rn
|
27 | 64 |
Twi Twi | tw
|
27 | 51 |
Walloon walon | wa
|
27 | 12 |
Yue Chinese | yue
|
27 | 28 |
Aragonese aragonés | an
|
26 | 20 |
Limburgish Limburgs | li
|
26 | 11 |
Volapük Volapük | vo
|
26 | 8 |
Chechen нохчийн мотт | ce
|
25 | 16 |
Ossetian ирон æвзаг | os
|
25 | 10 |
Iloko | ilo
|
25 | 24 |
Akan Akan | ak
|
24 | 38 |
Oromo Afaan Oromoo | om
|
24 | 38 |
Tigrinya ትግርኛ | ti
|
24 | 34 |
Manx Gaelg | gv
|
23 | 14 |
Cornish Kernewek | kw
|
23 | 8 |
Chamorro Chamoru | ch
|
22 | 11 |
Corsican corsu | co
|
22 | 30 |
Kikuyu Gĩkũyũ | ki
|
22 | 37 |
Venda Tshivenḓa | ve
|
22 | 6 |
Kabyle | kab
|
22 | 12 |
Sardinian sardu | sc
|
21 | 12 |
Northern Sami Davvisámegiella | se
|
21 | 11 |
Fon | fon
|
21 | 64 |
Abkhaz аҧсуа бызшәа | ab
|
20 | 30 |
Aymara aymar aru | ay
|
20 | 1 |
Bislama Bislama | bi
|
20 | 10 |
Ewe Eʋegbe | ee
|
20 | 25 |
Interlingue Interlingue | ie
|
20 | 1 |
Samoan gagana fa'a Samoa | sm
|
20 | 36 |
Swati SiSwati | ss
|
20 | 15 |
Tok Pisin | tpi
|
20 | 13 |
Bihari भोजपुरी | bh
|
19 | 4 |
Dzongkha རྫོང་ཁ | dz
|
19 | 8 |
Inuktitut ᐃᓄᒃᑎᑐᑦ | iu
|
19 | - |
Navajo Diné bizaad | nv
|
19 | 3 |
Pangasinan | pag
|
19 | 18 |
Papiamento | pap
|
19 | 18 |
Avaric авар мацӀ | av
|
18 | 2 |
Fula Fulfulde | ff
|
18 | 7 |
Kongo Kikongo | kg
|
18 | 17 |
Kalaallisut kalaallisut | kl
|
18 | 3 |
Komi коми кыв | kv
|
18 | 1 |
Tonga faka Tonga | to
|
18 | 14 |
Tumbuka | tum
|
18 | 44 |
Umbundu | umb
|
18 | 9 |
Luo | luo
|
18 | 12 |
Old Church Slavonic ѩзыкъ словѣньскъ | cu
|
17 | 2 |
Fijian Vakaviti | fj
|
17 | 17 |
Kashmiri कश्मीरी | ks
|
17 | 6 |
Sango yângâ tî sängö | sg
|
17 | 21 |
Tetun Dili | tdt
|
17 | 1 |
Kanuri Kanuri | kr
|
16 | 8 |
Marshallese Kajin M̧ajeļ | mh
|
16 | 11 |
Circassian Kabardian Адыгэбзэ КIахэ | kbd
|
16 | 7 |
Afar Afaraf | aa
|
15 | 4 |
Cree ᓀᐦᐃᔭᐍᐏᐣ | cr
|
15 | 5 |
Nuosu ꆈꌠ꒿ Nuosuhxop | ii
|
15 | - |
Pāli पाऴि | pi
|
15 | - |
Tahitian Reo Tahiti | ty
|
15 | 13 |
Zhuang Saɯ cueŋƅ | za
|
15 | - |
Hiligaynon | hil
|
15 | 9 |
Sranan Tongo | srn
|
15 | 10 |
Mossi | mos
|
15 | 34 |
Hiri Motu Hiri Motu | ho
|
14 | 7 |
Inupiaq Iñupiaq | ik
|
14 | - |
Kwanyama Kuanyama | kj
|
14 | 2 |
Nauru Ekakairũ Naoero | na
|
14 | 4 |
Ndonga Owambo | ng
|
14 | 2 |
Ojibwe ᐊᓂᔑᓈᐯᒧᐎᓐ | oj
|
14 | - |
Central Bikol | bcl
|
14 | 14 |
Pijin | pis
|
14 | 11 |
Berber languages | ber
|
14 | 6 |
Southern Ndebele isiNdebele | nr
|
13 | 1 |
American Sign Language | ase
|
13 | 9 |
Bemba | bem
|
13 | 15 |
Niuean | niu
|
12 | 12 |
Pohnpeian | pon
|
12 | 11 |
Tonga (Zambia) | toi
|
12 | 8 |
Yapese | yap
|
12 | 6 |
Wolaitta | wal
|
12 | 1 |
Circassian Adyghean Адыгэбзэ Къэбэрдей | ady
|
12 | - |
Avestan avesta | ae
|
11 | 1 |
Herero Otjiherero | hz
|
11 | - |
Luba-Katanga Tshiluba | lu
|
11 | 10 |
Northern Ndebele isiNdebele | nd
|
11 | - |
Seselwa Creole French | crs
|
11 | 12 |
Luba-Lulua | lua
|
11 | 17 |
Lushai | lus
|
11 | 15 |
Congo Swahili | swc
|
11 | 10 |
Tuvalua | tvl
|
11 | 10 |
Tiv | tiv
|
11 | 7 |
Chuukese | chk
|
11 | 6 |
Morisyen | mfe
|
11 | 6 |
Romance languages | roa
|
11 | 12 |
Tzotzil | tzo
|
11 | 2 |
Isthmus Zapotec | zai
|
11 | 2 |
Yucateco | yua
|
11 | 1 |
Ga | gaa
|
10 | 12 |
Efik | efi
|
10 | 11 |
Gilbertese | gil
|
10 | 11 |
Lozi | loz
|
10 | 10 |
Tetela | tll
|
10 | 10 |
Luvale | lue
|
10 | 9 |
Kaonde | kqn
|
10 | 8 |
Rundi | run
|
10 | 7 |
Wallisian | wls
|
10 | 7 |
Zande (individual language) | zne
|
10 | 7 |
San Salvador Kongo | kwy
|
10 | 6 |
Ruund | rnd
|
10 | 6 |
Lunda | lun
|
10 | 2 |
Nyaneka | nyk
|
10 | 2 |
Peruvian Sign Language | prl
|
10 | 2 |
Spanish Sign Language | ssp
|
10 | 1 |
Venezuelan Sign Language | vsl
|
10 | 1 |
Acoli | ach
|
9 | 1 |
Gun | guw
|
9 | 12 |
Brazilian Sign Language | bzs
|
9 | 11 |
Isoko | iso
|
9 | 11 |
Argentine Sign Language | aed
|
9 | 2 |
Chilean Sign Language | csg
|
9 | 2 |
Colombian Sign Language | csn
|
9 | 2 |
Kwangali | kwn
|
9 | 2 |
Mexican Sign Language | mfs
|
9 | 2 |
Finnish Sign Language | fse
|
9 | 2 |
Celtic languages | cel
|
9 | 4 |
You can check the data source for the ISO 639 codes in this repo. Contributions are welcome!