Instruction-tuned model for a range of vision-language tasks
Generate stunning high quality illusion artwork
Generate images from text descriptions
Generate optimized prompts for Stable Diffusion
Describe images using multiple models