omarmomen commited on
Commit
8da597c
1 Parent(s): 07ea877

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -1
README.md CHANGED
@@ -5,4 +5,25 @@ language:
5
  tags:
6
  - text simplification
7
  - german
8
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  tags:
6
  - text simplification
7
  - german
8
+ datasets:
9
+ - DEplain/DEplain-APA
10
+ metrics:
11
+ - sari
12
+ - bleu
13
+ - bertscore
14
+ library_name: transformers
15
+ pipeline_tag: text2text-generation
16
+ ---
17
+
18
+ # DEplain German Text Simplification
19
+
20
+ This model belongs to the experiments done at the work of Stodden, Momen, Kallmeyer (2023). ["DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification."](https://arxiv.org/abs/2305.18939) In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Toronto, Canada. Association for Computational Linguistics.
21
+ Detailed documentation can be found on this GitHub repository [https://github.com/rstodden/DEPlain](https://github.com/rstodden/DEPlain)
22
+
23
+ ### Model Description
24
+
25
+ The model is a finetuned checkpoint of the pre-trained LongmBART model based on `mbart-large-cc25`. With a trimmed vocabulary to the most frequent 30k words in the German language.
26
+
27
+ The model was finetuned towards the task of German text simplification of documents.
28
+
29
+ The finetuning dataset included manually aligned sentences from the datasets `DEplain-APA-doc` only.