MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models Paper • 2410.10139 • Published Oct 14, 2024 • 51
tsbpp/llava-vicuna-7b-diffusion-sd2_1-p16-res512-737k-bs512 Text Generation • Updated Jul 31, 2024 • 8