ai-forever
/

RuM2M100-418M

Text2Text Generation

natural language generation

Inference Endpoints

Model card Files Files and versions Community

ai-forever commited on Aug 31, 2023

Commit

047e891

•

1 Parent(s): 60dd1c8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ tags:
 ### Summary
 The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
 The proofreader was trained based on the [M2M100-418M](https://huggingface.co/facebook/m2m100_418M) model.
-An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE library](https://github.com /orgs/ai-forever/sage).
 ### Public references
 - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023

 ### Summary
 The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
 The proofreader was trained based on the [M2M100-418M](https://huggingface.co/facebook/m2m100_418M) model.
+An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE library](https://github.com/orgs/ai-forever/sage).
 ### Public references
 - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023