ai-forever
/

RuM2M100-1.2B

@@ -17,10 +17,12 @@ The model corrects spelling errors and typos by bringing all the words in the te
 Corrector was trained based on the model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B).
 An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library [SAGE](https://github.com/orgs/ai-forever/sage).
-### Articles and speeches
-- [Speech about the SAGE library](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
-- [Article about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
-- [Article about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
 ### Examples
 | Input | Output |
@@ -104,7 +106,7 @@ print(answer)
 ```
 ## Resources
-- [SAGE library code with augmentation methods, access to datasets and open models](https://github.com/orgs/ai-forever/sage), GitHub
 - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
 - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
 - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
@@ -112,7 +114,7 @@ print(answer)
 ## License
 Model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B), on the basis of which our solution is made, and its source code are supplied under the MIT open license.
-Our solution also comes with an MIT license.
 ## Specifications
 - File size: 5 Gb;
@@ -122,4 +124,4 @@ Our solution also comes with an MIT license.
 - Developer: SberDevices, AGI NLP
 ## Contacts
-For questions related to the operation and application of the model, please contact the product manager: Pavel Lebedev PIgLebedev@sberbank.ru.

 Corrector was trained based on the model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B).
 An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the library [SAGE](https://github.com/orgs/ai-forever/sage).
+### Public references
+- [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
+- [Paper about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
+- [Paper about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
+- [Path_to_model](https://huggingface.co/ai-forever/RuM2M100-1.2B)
 ### Examples
 | Input | Output |
 ```
 ## Resources
+- [SAGE library](https://github.com/orgs/ai-forever/sage), GitHub
 - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
 - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
 - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
 ## License
 Model [M2M100-1.2B](https://huggingface.co/facebook/m2m100_1.2B), on the basis of which our solution is made, and its source code are supplied under the MIT open license.
+Our solution also comes with MIT license.
 ## Specifications
 - File size: 5 Gb;
 - Developer: SberDevices, AGI NLP
 ## Contacts
+nikita.martynov.98@list.ru