ai-forever commited on
Commit
f318028
1 Parent(s): e954845

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -15,7 +15,7 @@ tags:
15
  ### Summary
16
  The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
17
  The proofreader was trained based on the [M2M100-418M](https://huggingface.co/facebook/m2m100_418M) model.
18
- An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE library](https://github.com/orgs/ai-forever/sage).
19
 
20
  ### Public references
21
  - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
@@ -103,7 +103,7 @@ print(answer)
103
  ```
104
 
105
  ## Resources
106
- - [SAGE library](https://github.com/orgs/ai-forever/sage), GitHub
107
  - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
108
  - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
109
  - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace
 
15
  ### Summary
16
  The model corrects spelling errors and typos by bringing all the words in the text to the norm of the Russian language.
17
  The proofreader was trained based on the [M2M100-418M](https://huggingface.co/facebook/m2m100_418M) model.
18
+ An extensive dataset with “artificial” errors was taken as a training corpus: the corpus was assembled on the basis of the Russian-language Wikipedia and transcripts of Russian-language videos, then typos and spelling errors were automatically introduced into it using the functionality of the [SAGE library](https://github.com/ai-forever/sage).
19
 
20
  ### Public references
21
  - [SAGE library announcement](https://youtu.be/yFfkV0Qjuu0), DataFest 2023
 
103
  ```
104
 
105
  ## Resources
106
+ - [SAGE library](https://github.com/ai-forever/sage), GitHub
107
  - [ruM2M100-1.2B](https://huggingface.co/ai-forever/RuM2M100-1.2B), HuggingFace
108
  - [ruM2M100-418M](https://huggingface.co/ai-forever/RuM2M100-420M), HuggingFace
109
  - [FredT5-large-spell](https://huggingface.co/ai-forever/FRED-T5-large-spell), HuggingFace