Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes? Mar 5 • 4
mlfoundations-dev/airoboros_stage_3_none_resp_gpt-4o_inst_gpt-4o-mini_resp_test Viewer • Updated about 3 hours ago • 1.6k
mlfoundations-dev/airoboros_stage_2_none_resp_gpt-4o-mini Viewer • Updated about 12 hours ago • 21.4k