Back to all models
text-generation mask_token:
Query this model
🔥 This model is currently loaded and running on the Inference API. ⚠️ This model could not be loaded by the inference API. ⚠️ This model can be loaded on the Inference API on-demand.
JSON Output
API endpoint
								$
								curl -X POST \
-H "Authorization: Bearer YOUR_ORG_OR_USER_API_TOKEN" \
-H "Content-Type: application/json" \
-d '"json encoded string"' \
https://api-inference.huggingface.co/models/lvwerra/gpt2-imdb-ctrl
Share Copied link to clipboard

Monthly model downloads

lvwerra/gpt2-imdb-ctrl lvwerra/gpt2-imdb-ctrl
62 downloads
last 30 days

pytorch

tf

Contributed by

lvwerra Leandro von Werra
5 models

How to use this model directly from the 🤗/transformers library:

			
Copy to clipboard
from transformers import AutoTokenizer, AutoModelWithLMHead tokenizer = AutoTokenizer.from_pretrained("lvwerra/gpt2-imdb-ctrl") model = AutoModelWithLMHead.from_pretrained("lvwerra/gpt2-imdb-ctrl")

GPT2-IMDB-ctrl

What is it?

A small GPT2 (lvwerra/gpt2-imdb) language model fine-tuned to produce controlled movie reviews based the IMDB dataset. The model is trained with rewards from a BERT sentiment classifier (lvwerra/bert-imdb) via PPO.

Training setting

The model was trained for 200 optimisation steps with a batch size of 256 which corresponds to 51200 training samples. The full experiment setup can be found in the Jupyter notebook in the trl repo. The strings "[negative]", "[neutral]", and "[positive]" are added at the beginning of the query to control the sentiment.

Examples

A few examples of the model response to a query before and after optimisation:

query response [negative] rewards [negative] response [neutral] rewards [neutral] response [positive] rewards [positive]
I watched this movie when it was released and was awful. Little bit of ... 3.130034 it was released and it was the first movie I ... -1.351991 I was younger it was wonderful. The new play ... 4.232218
I can remember seeing this movie in 2008, and I was so disappointed...yo... 3.428725 in support groups, which I think was not as i... 0.213288 movie, and it is one of my favorite movies ev... 4.168838
This 1970 hit film has little resonance. This movie is bad, not only... 4.241872 a bit of Rocket power.783287. It can be easil... 0.849278 the best formula for comedy and is't just jus... 4.208804