Edit model card

Fine tuned detoxified Falcon-7B with PPO algorithm and Reward Model on SUPER TOXIC PROMPTS.

Downloads last month
2