q-future/Q-Instruct-DB
Preview
•
Updated
•
93
•
17
Collections of multimodal (image+text) instruction finetuning datasets tailored for visual language models like LlaVA, Fuyu, or IDEFICS.