annadmitrieva commited on
Commit
8ea5d82
1 Parent(s): 615aeaa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -8,4 +8,6 @@ pipeline_tag: text-generation
8
 
9
  This is the Finnish GPT3 XL model (https://huggingface.co/TurkuNLP/gpt3-finnish-xl/) finetuned for simplification. Finetuning was done according to these instructions: https://github.com/spyysalo/instruction-finetune. The instruction for simplification is "Mukauta selkosuomeksi\n\n".
10
 
11
- The finetuning data can be obtained here: http://urn.fi/urn:nbn:fi:lb-2024011703.
 
 
 
8
 
9
  This is the Finnish GPT3 XL model (https://huggingface.co/TurkuNLP/gpt3-finnish-xl/) finetuned for simplification. Finetuning was done according to these instructions: https://github.com/spyysalo/instruction-finetune. The instruction for simplification is "Mukauta selkosuomeksi\n\n".
10
 
11
+ Paper: [Towards Automatic Finnish Text Simplification](https://aclanthology.org/2024.determit-1.4.pdf) (Dmitrieva & Tiedemann, DeTermIt-WS 2024).
12
+
13
+ The finetuning data can be obtained here: http://urn.fi/urn:nbn:fi:lb-2024011703. If you wish to replicate the results, you can find the training, validation, and testing sentence pairs' ids in the "splits.zip" archive in this repository. The ids contain the following information: "{regular text id}\__{simple text id}__{sentence pair number}".