Collection of Papers on Code Evaluation (from code generation language models)
-
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 20 -
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation
Paper • 2102.04664 • Published • 2 -
Evaluating Large Language Models Trained on Code
Paper • 2107.03374 • Published • 6 -
Out of the BLEU: how should we assess quality of the Code Generation models?
Paper • 2208.03133 • Published • 2