🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 14 items • Updated 3 days ago • 102
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published Feb 11 • 51