LLM as a Judge Collection Curated resources that support the use of LLMs to serve as automatic evaluators of other LLM outputs. • 15 items • Updated 27 days ago • 16
Code Evaluation Collection Collection of Papers on Code Evaluation (from code generation language models) • 30 items • Updated 30 days ago • 7