The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Paper • 2501.09653 • Published Jan 16 • 12
NLBSE25 Code Comment Classification Competition Collection Dataset and baseline models for the NLBSE'25 Code Comment Classification competition • 4 items • Updated Oct 11, 2024
Traces of Memorisation in Large Language Models for Code Paper • 2312.11658 • Published Dec 18, 2023