ai-progress-charts / arc_agi_leaderboard.jsonl
kaizuberbuehler's picture
Update data of ARC-AGI and Simple Bench; Add Codeforces and PlanBench
03738e4
raw
history blame contribute delete
263 Bytes
{"model": "o3", "score": 82.8}
{"model": "o1-preview-2024-09-12", "score": 21}
{"model": "claude-3-5-sonnet-20240620", "score": 21}
{"model": "o1-mini-2024-09-12", "score": 13}
{"model": "gpt-4o-2024-05-13", "score": 9}
{"model": "gemini-1.5-pro-001", "score": 8}