Full training code

#21
by vakker - opened

Hi,

It would be really useful to see the training code that was used.
I went through the fine-tuning samples, which are very useful, but I'm very curious about how the HF ecosystem can be utilized for training this model from scratch.

I would also be interested in at least some tips or clues. It would be wonderful to know how to choose which backbones you want for the vision and language model.

Sign up or log in to comment