OffsetBias: Leveraging Debiased Data for Tuning Evaluators Paper • 2407.06551 • Published Jul 9, 2024 • 1
Varco Arena: A Tournament Approach to Reference-Free Benchmarking Large Language Models Paper • 2411.01281 • Published Nov 2, 2024 • 6