Spaces:
Paused
Paused
Notebooks
This directory contains a collection of Jupyter notebooks that demonstrate how to use the TRL library in different applications.
best_of_n.ipynb
: This notebook demonstrates how to use the "Best of N" sampling strategy using TRL when fine-tuning your model with PPO.gpt2-sentiment.ipynb
: This notebook demonstrates how to reproduce the GPT2 imdb sentiment tuning example on a jupyter notebook.gpt2-sentiment-control.ipynb
: This notebook demonstrates how to reproduce the GPT2 sentiment control example on a jupyter notebook.