DPO Jon
Collection
DPO For Jon
•
45 items
•
Updated
This model is a fine-tuned version of EleutherAI/pythia-1b on the None dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
EleutherAI/pythia-1b