metadata
language:
- en
tags:
- vision-language
- clip
- vilt
datasets:
- lil-lab/kilogram-data
KiloGram dataset and code repo: https://github.com/lil-lab/kilogram
Preprocessed training and evaluation data: https://huggingface.co/datasets/lil-lab/kilogram-data
Citation
@misc{ji2022abstractvisualreasoningtangram,
title={Abstract Visual Reasoning with Tangram Shapes},
author={Anya Ji and Noriyuki Kojima and Noah Rush and Alane Suhr and Wai Keen Vong and Robert D. Hawkins and Yoav Artzi},
year={2022},
eprint={2211.16492},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2211.16492},
}