LLM as a Judge Collection Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs. • 15 items • Updated May 16 • 19
Code Evaluation Collection Collection of Papers on Code Evaluation (from code generation language models) • 36 items • Updated about 24 hours ago • 9