Consistency-diversity-realism Pareto fronts of conditional image generative models Paper • 2406.10429 • Published Jun 14, 2024
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper • 2412.04626 • Published Dec 5, 2024 • 13
Improving Text-to-Image Consistency via Automatic Prompt Optimization Paper • 2403.17804 • Published Mar 26, 2024 • 17
Improving Text-to-Image Consistency via Automatic Prompt Optimization Paper • 2403.17804 • Published Mar 26, 2024 • 17
MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot Prompting Paper • 2210.07179 • Published Oct 13, 2022 • 3
Improving Automatic VQA Evaluation Using Large Language Models Paper • 2310.02567 • Published Oct 4, 2023 • 3
Measuring Progress in Fine-grained Vision-and-Language Understanding Paper • 2305.07558 • Published May 12, 2023 • 1