M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper β’ 2411.06176 β’ Published Nov 9, 2024 β’ 44
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions Paper β’ 2405.20267 β’ Published May 30, 2024 β’ 1
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions Paper β’ 2405.20267 β’ Published May 30, 2024 β’ 1