diffnamehard's picture
Update README.md
569a453
|
raw
history blame
311 Bytes
metadata
license: apache-2.0

This is an experimental model.

Trained on dataset toxic-dpo-v0.1-NoWarning-alpaca using model Mistral-CatMacaroni-slerp-gradient