ai-forever commited on
Commit
43d8da0
1 Parent(s): 09969eb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -15,12 +15,13 @@ tags:
15
  ### Summary
16
  The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
17
  The proofreader was trained based on the [FredT5-large](https://huggingface.co/ai-forever/FRED-T5-large) model.
18
- An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE] library (https://github.com /orgs/ai-forever/sage).
19
 
20
- ### Articles and speeches
21
- - [Speech about the SAGE library](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
22
- - [Article about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
23
- - [Article about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
 
24
 
25
  ### Examples
26
  *Examples are given with default generation parameters
@@ -106,7 +107,7 @@ print(answer)
106
  ```
107
 
108
  ## Resources
109
- - [SAGE library code with augmentation methods, access to datasets and open models](https://github.com/orgs/ai-forever/sage), GitHub
110
  - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
111
  - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
112
  - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
@@ -114,7 +115,7 @@ print(answer)
114
 
115
  ## License
116
  Model [FRED-T5-large](https://huggingface.co/ai-forever/FRED-T5-large), on the basis of which our solution is made, and its source code are supplied under the APACHE-2.0 license.
117
- Our solution also comes with an MIT license.
118
 
119
  ## Specifications
120
  - File size: 3.5 Gb;
@@ -124,4 +125,4 @@ Our solution also comes with an MIT license.
124
  - Developer: SberDevices, AGI NLP
125
 
126
  ## Contacts
127
- For questions related to the operation and application of the model, please contact the product manager: Pavel Lebedev PIgLebedev@sberbank.ru.
 
15
  ### Summary
16
  The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
17
  The proofreader was trained based on the [FredT5-large](https://huggingface.co/ai-forever/FRED-T5-large) model.
18
+ An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE librar](https://github.com/orgs/ai-forever/sage).
19
 
20
+ ### Public references
21
+ - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
22
+ - [Paper about synthetic error generation methods](https://www.dialog-21.ru/media/5914/martynovnplusetal056.pdf), Dialogue 2023
23
+ - [Paper about SAGE and our best solution](https://arxiv.org/abs/2308.09435), Review EACL 2024
24
+ - [Path_to_model](https://huggingface.co/ai-forever/FRED-T5-large-spell)
25
 
26
  ### Examples
27
  *Examples are given with default generation parameters
 
107
  ```
108
 
109
  ## Resources
110
+ - [SAGE library](https://github.com/orgs/ai-forever/sage), GitHub
111
  - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
112
  - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
113
  - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
 
115
 
116
  ## License
117
  Model [FRED-T5-large](https://huggingface.co/ai-forever/FRED-T5-large), on the basis of which our solution is made, and its source code are supplied under the APACHE-2.0 license.
118
+ Our solution also comes with MIT license.
119
 
120
  ## Specifications
121
  - File size: 3.5 Gb;
 
125
  - Developer: SberDevices, AGI NLP
126
 
127
  ## Contacts
128
+ nikita.martynov.98@list.ru