ai-progress-charts / simple_bench_leaderboard.jsonl
kaizuberbuehler's picture
Add data for ARC-AGI and Simple Bench
9ac5371
raw
history blame
466 Bytes
{"model": "claude-3-5-sonnet-20240620", "score": 27}
{"model": "gpt-4-1106-preview", "score": 26}
{"model": "claude-3-opus-20240229", "score": 25}
{"model": "llama-3.1-405b-instruct-fp8", "score": 22}
{"model": "gemini-1.5-pro-001", "score": 21}
{"model": "gpt-4-0613", "score": 18}
{"model": "gpt-4o-2024-05-13", "score": 16}
{"model": "deepseek-v2-api-0628", "score": 15}
{"model": "mistral-large-2407", "score": 13}
{"model": "gpt-4o-mini-2024-07-18", "score": 5}