Can existing large datasets be used to fine tune the blip'large_caption task?

#29
by shams123321 - opened

Can existing large datasets be used to fine tune the blip'large_caption task?
The dataset used is UFine6926 dataset, as its text description of images is very fine-grained, with an average of 80.8 words per image. Can this dataset be used to fine tune the caption task of blip?
Looking forward to your reply!
Thank you!

Sign up or log in to comment