15 Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots · 8 authors 4
14 MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels · 31 authors 1