Zachary Siegel commited on
Commit
9bb290b
·
1 Parent(s): 34abc58

update claude and o1 mini

Browse files
evals_live/corebench_hard_core-agent_o1-mini_20241116.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a3ef73cfe040f4e14066e51b8c9389ea8979568cd14c9855556322138e041df8
3
- size 1454
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:860b1cfaa88803d2b88ed6c6335840d5cf2f20e18a58ad7c691713ee83096101
3
+ size 1471
evals_live/{corebench_hard_core-agent_claude-3_5-sonnet_20241116.json → corebench_hard_core_agent_claude-3_5-sonnet_20241118.json} RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2aed8e6a9228b2ccfdfbfb31690d0a6892f274f2bca7981a097ad2985fc2dd0d
3
- size 1474
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f87cd39f5d66e046aef00aef02c9ccb0812644a585d9322385b95adca224e3b6
3
+ size 102581845