codearena-rl / rewards_log.csv
havinashpatil
Finalizing CodeArena RL Benchmark: frontend improvements, GRPO training scripts, and cleaned environment
03a7eb9
raw
history blame contribute delete
644 Bytes
timestamp,task_id,step,reward,compile_score,test_ratio,efficiency_score
2026-04-25T11:18:35.777063,easy-1,5,0.01,0.0,0.0,0.0
2026-04-26T01:38:27.213698,easy-1,5,0.01,0.0,0.0,0.0
2026-04-26 01:51:22,easy-1,5,0.20000000000000004,0.0,0.0,0.0
2026-04-26 01:52:42,easy-1,5,0,0.0,0.0,0.0
2026-04-26 01:54:20,easy-1,5,0.6500000000000001,0.0,0.0,0.0
2026-04-26 01:55:07,easy-1,5,0.6500000000000001,0.0,0.0,0.0
2026-04-26 01:55:38,easy-1,5,0.6500000000000001,0.0,0.0,0.0
2026-04-26 01:56:11,easy-1,5,0.6500000000000001,0.0,0.0,0.0
2026-04-26 02:01:49,medium-1,5,0.6500000000000001,0.0,0.0,0.0
2026-04-26 02:02:35,hard-1,5,0.7500000000000001,0.0,0.0,0.0