May I ask how this differs from the 10ep version?

by bdsqlsz - opened Jul 16, 2024

Jul 16, 2024

Thank you for your great work.
SPO works very well, but I just discovered this model, is this the version that trains higher epochs?
If possible please add a lora version.

bdsqlsz

Jul 16, 2024

I rechecked github and it turns out that this is for training the preference model used.

bdsqlsz changed discussion status to closed Jul 16, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment