ai-forever
/

mGPT

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ai-forever commited on Apr 19, 2022

Commit

cc20058

•

1 Parent(s): 0fb1ca9

Update README.md

Files changed (1) hide show

README.md +30 -6

README.md CHANGED Viewed

@@ -87,18 +87,42 @@ We reproduce the GPT-3 architecture using GPT-2 sources and the sparse attention
 The source code for the mGPT XL model is available on [Github](https://github.com/sberbank-ai/mgpt)
 ## Paper
-[Arxiv preprint](https://arxiv.org/user)
-Cite us:
-```{
-bibtex
 }
-```
 ## Languages
-Model includes 60 languages: (iso codes)
 ```az, sw, af, ar, ba, be, bxr, bg, bn, cv, hy, da, de, el, es, eu, fa, fi, fr, he, hi, hu, kk, id, it, ja, ka, ky, ko, lt, lv, mn, ml, os, mr, ms, my, nl, ro, pl, pt, sah, ru, tg, sv, ta, te, tk, th, tr, tl, tt, tyv, uk, en, ur, vi, uz, yo, zh, xal```
 ## Training Data Statistics

 The source code for the mGPT XL model is available on [Github](https://github.com/sberbank-ai/mgpt)
 ## Paper
+ mGPT: Few-Shot Learners Go Multilingual
+ [Abstract](https://arxiv.org/abs/2204.07580) [PDF](https://arxiv.org/pdf/2204.07580.pdf)
+ ![](https://habrastorage.org/webt/1q/ru/yt/1qruytul6m2m-upyk9frq3pgrds.png)
+ ```
+@misc{https://doi.org/10.48550/arxiv.2204.07580,
+  doi = {10.48550/ARXIV.2204.07580},
+  url = {https://arxiv.org/abs/2204.07580},
+  author = {Shliazhko, Oleh and Fenogenova, Alena and Tikhonova, Maria and Mikhailov, Vladislav and Kozlova, Anastasia and Shavrina, Tatiana},
+  keywords = {Computation and Language (cs.CL), Artificial Intelligence (cs.AI), FOS: Computer and information sciences, FOS: Computer and information sciences, I.2; I.2.7, 68-06, 68-04, 68T50, 68T01},
+  title = {mGPT: Few-Shot Learners Go Multilingual},
+  publisher = {arXiv},
+  year = {2022},
+  copyright = {Creative Commons Attribution 4.0 International}
 }
+ ```
 ## Languages
+Model supports 60 languages:
+ISO codes:
 ```az, sw, af, ar, ba, be, bxr, bg, bn, cv, hy, da, de, el, es, eu, fa, fi, fr, he, hi, hu, kk, id, it, ja, ka, ky, ko, lt, lv, mn, ml, os, mr, ms, my, nl, ro, pl, pt, sah, ru, tg, sv, ta, te, tk, th, tr, tl, tt, tyv, uk, en, ur, vi, uz, yo, zh, xal```
+Languages:
+```Afrikaans, Azerbaijani, Belarusian, Bengali, Chuvash, German, English, Basque, Finnish, Hebrew (modern), Hungarian, Indonesian, Japanese, Kazakh, Kirghiz, Kyrgyz, Latvian, Mongolian, Malay, Dutch, Polish, Romanian, Moldavan, Yakut, Swahili, Telugu, Thai, Turkish, Tuvinian, Urdu, Vietnamese, Yoruba, Arabic, Bashkir, Bulgarian, Buriat, Danish, Greek, Modern, Spanish; Castilian, Persian, French, Hindi, Armenian, Italian, Georgian, Korean, Lithuanian, Malayalam, Marathi, Burmese, Ossetian, Ossetic, Portuguese, Russian, Swedish, Tamil, Tajik, Turkmen, Tatar, Ukrainian, Uzbek, Kalmyk, Chinese
+```
 ## Training Data Statistics