Submitted by yilunzhao 11 Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers Yale NLP Lab 6 1