omarmomen commited on
Commit
21f7260
1 Parent(s): b68231f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - DEplain/DEplain-APA
4
+ language:
5
+ - de
6
+ metrics:
7
+ - bleu
8
+ - sari
9
+ - bertscore
10
+ library_name: transformers
11
+ pipeline_tag: text2text-generation
12
+ ---
13
+
14
+ # DEplain German Text Simplification
15
+
16
+ This model belongs to the experiments done at the work of Stodden, Momen, Kallmeyer (2023). ["DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification."](https://arxiv.org/abs/2305.18939) In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Toronto, Canada. Association for Computational Linguistics.
17
+ Detailed documentation can be found on this GitHub repository [https://github.com/rstodden/DEPlain](https://github.com/rstodden/DEPlain)
18
+
19
+ ### Model Description
20
+
21
+ The model is a finetuned checkpoint of the pre-trained mBART model `mbart-large-cc25`. With a trimmed vocabulary to the most frequent 30k words in the German language.
22
+
23
+ The model was finetuned towards the task of German text simplification of sentences.
24
+
25
+ The finetuning dataset included manually aligned sentences from the dataset `DEplain-APA-sent` only.