lenguist commited on
Commit
28c27b8
1 Parent(s): 3a73454

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - uk
4
+ library_name: transformers
5
+ pipeline_tag: text2text-generation
6
+ tags:
7
+ - grammatical error correction
8
+ - GEC
9
+ - Grammar
10
+ ---
11
+ This model was trained by Pravopysnyk team for the Ukrainian NLP shared task in Ukrainian grammar correction. The model is MBart-50-large
12
+ set to ukr-to-ukr translation task finetuned on UA-GEC augmented by custom dataset generated using our synthetic error generation.
13
+ The code for error generation will be uploaded on github soon, and the detailed procedure is described in our paper. For this model we added to ua-gec:-
14
+ 5k sentences generated by round translaion (ukr-rus-ukr)
15
+ 10k sentences using our punctuation error generation script
16
+ 2k of dilution (just fully correct sentences sampled from our dataset)
17
+ 10k of russism (errors generated using our russism error generation).
18
+
19
+ Hope you find this description helpful!