ScreenAI: A Vision-Language Model for UI and Infographics Understanding Paper • 2402.04615 • Published Feb 7 • 33
InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions Paper • 2401.13313 • Published Jan 24 • 5