Edit model card

roberta-poetry-life-crpo

This model is based on the RoBERTa base model (125M parameters) fine-tuned for 20 epochs on a poetry dataset of 14 MB. This dataset was extracted from the Gutenberg Poetry Corpus using an automatic classifier for poems in relation with the topic of life and death.

The model replaces a masked word, indicated by the <mask> tag, with a word associated with life and death, while preserving fluency. Caution: the topic (here, life and death) only biases the choice of words with respect to the base model, but do not expect to find only words strongly associated to this topic.

This model was trained by Teo Ferrari as part of his Bachelor thesis at HEIG-VD, supervised by Andrei Popescu-Belis. The model is described in "GPoeT: a Language Model Trained for Rhyme Generation on Synthetic Data" and is used in the CR-PO system for interactive poem generation, along with several other models for specific topics or emotions.

Downloads last month
4
Safetensors
Model size
125M params
Tensor type
I64
·
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.