Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
core_leaderboard
3 contributors
History:
150 commits
Zachary Siegel
updatae db
eadf8af
3 months ago
agent_monitor
Big update with SQL backend
6 months ago
evals_live
fix typo in agent name
3 months ago
evals_processed
init files to keep dirs open
6 months ago
evals_upload
init files to keep dirs open
6 months ago
utils
add results to leaderboard
5 months ago
.gitattributes
Safe
2.05 kB
Upload preprocessed_traces.db
6 months ago
.gitignore
Safe
115 Bytes
update corebench results
3 months ago
README copy.md
Safe
14.7 kB
init
7 months ago
README.md
Safe
236 Bytes
initial commit
7 months ago
about.md
Safe
5.39 kB
Upload 3 files
6 months ago
agent_submission.md
Safe
2.76 kB
submit to any of the three levels
5 months ago
app.py
Safe
18.7 kB
update title
5 months ago
benchmark_submission.md
Safe
496 Bytes
Upload 3 files
6 months ago
config.py
Safe
1.37 kB
added first agent to leaderboard
5 months ago
css.css
Safe
997 Bytes
vis update
6 months ago
envs.py
Safe
191 Bytes
added auto update
7 months ago
hal.ico
Safe
15.4 kB
Upload 5 files
6 months ago
hal.png
Safe
1.03 kB
Upload 5 files
6 months ago
header.md
Safe
118 Bytes
vis update
6 months ago
preprocessed_traces.db
Safe
128 MB
LFS
updatae db
3 months ago
requirements.txt
Safe
1.86 kB
Upload requirements.txt
6 months ago
scratch.py
Safe
1.61 kB
vis update
6 months ago
verified_agents.yaml
Safe
1.3 kB
verify o1 mini
3 months ago