What does this PR do?

This PR adds the LM detoxification example to this space!

Link to docs: https://huggingface.co/docs/trl/main/en/detoxifying_a_lm

Under the hood, we used RL (Reinforcement Learning), to teach the model on not to generate toxic contents, given toxic prompts!

