Training time and resoruces

by adendek - opened Jun 11, 2024

Discussion

adendek

Jun 11, 2024

•

edited Jun 11, 2024

Dear authors,

Could you please share the resources (time and hardware) that were needed to train or finetune this model?

hplisiecki

Owner Jun 19, 2024

We conducted the training on a V100 GPU in Google Colab with a batch size of 100. On average, it took about 2 seconds per epoch, and we performed 10 epochs in one sweep.

However, the model is small enough to fit into even an 8 GB GPU. Training would also work on such a GPU, but it would take significantly longer.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment