Fine tuning on text domain

by wilfoderek - opened Jul 5, 2023

Jul 5, 2023

Hi guys!
Please, share the receip to fine tuning in my own corpus this amazin model.
Thank you in advance!

Owner Jul 8, 2023

You can use any dense retrieval training framework, just replace the model initialization with this one.

Personally, I would recommend Tevatron and SimLM codebase.

Jul 12, 2023

Hi,
What codebase can I refer to if I want to implement the pre-training stage in your e5 paper?

Owner Jul 12, 2023

@Chuzhan The implementation for pre-training part is even simpler, you can use the same codebase by removing the hard negatives.

Geo

Aug 7, 2023

•

Is there a tutorial on how to fine tune this model on a Greek sentence similarity dataset?

Owner Aug 8, 2023

I am not aware of any, but you can adapt existing codebase for your specific needs.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment