Fine-tuning on multi GPUs
#14
by
hayj
- opened
Is there a get-started example on how to fine-tune this model using torch and DDP? (which is not possible with sentence-transformers)
https://github.com/microsoft/unilm/tree/master/simlm
You can use this.