Thank you very much for this model, I have questions

by NickyNicky - opened 9 days ago

9 days ago

I would like to know how they made fine tune?

Did you use the huggingface trl GRPO libraries?

Could you provide the libraries for the training?

thank you so much

Open Thoughts org 9 days ago

Mykes

about 10 hours ago

Thank you for your model! Did you use only SFT or another methods (like DPO, KTO or PPO)?

Open Thoughts org about 5 hours ago

We only used SFT for this model

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment