Can existing large datasets be used to fine tune the blip'large_caption task?

#29

by shams123321 - opened Mar 21

Mar 21

Can existing large datasets be used to fine tune the blip'large_caption task?
The dataset used is UFine6926 dataset, as its text description of images is very fine-grained, with an average of 80.8 words per image. Can this dataset be used to fine tune the caption task of blip?
Looking forward to your reply!
Thank you!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment