@davidberenstein1957 on Hugging Face: "A while ago, I presented this Phi2 DPO fine-tune notebook with LoRa. Got some…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

davidberenstein1957

posted an update Feb 8

Post

A while ago, I presented this Phi2 DPO fine-tune notebook with LoRa. Got some input from @ybelkada about not needing a ref_model because we can just swap out the LoRa adapters during training. Cool feature 🤓

https://colab.research.google.com/drive/1PGMj7jlkJaCiSNNihA2NtpILsRgkRXrJ#scrollTo=wXqoH2TMnjjp

Feb 8

Very nice demo !!

Feb 8

Can you elaborate more plz ?

·

davidberenstein1957

Feb 19

On which part @Ali-C137?

In this post