Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
agent-evals
/
leaderboard
like
0
Running
App
Files
Files
Community
main
leaderboard
1 contributor
History:
9 commits
benediktstroebl
hide swebench lite and mlagentbench
512799d
21 days ago
agent_monitor
minor tweaks
21 days ago
utils
minor tweaks
21 days ago
.gitattributes
Safe
1.58 kB
Upload preprocessed_traces.db
24 days ago
.gitignore
Safe
74 Bytes
init v1
24 days ago
README.md
Safe
236 Bytes
init v1
24 days ago
about.md
Safe
5.39 kB
init v1
24 days ago
agent_performance_analysis.json
Safe
5.08 kB
init v1
24 days ago
agent_submission.md
Safe
766 Bytes
init v1
24 days ago
agent_submission_core.md
Safe
2.77 kB
init v1
24 days ago
app.py
Safe
82.2 kB
hide swebench lite and mlagentbench
21 days ago
benchmark_submission.md
Safe
496 Bytes
init v1
24 days ago
config.py
Safe
2.07 kB
init v1
24 days ago
css.css
Safe
936 Bytes
init v1
24 days ago
envs.py
Safe
191 Bytes
init v1
24 days ago
hal.ico
Safe
15.4 kB
init v1
24 days ago
hal.png
Safe
1.03 kB
init v1
24 days ago
header.md
Safe
118 Bytes
init v1
24 days ago
preprocessed_traces.db
Safe
1.95 GB
LFS
Upload preprocessed_traces.db
22 days ago
requirements.txt
Safe
1.84 kB
init v1
24 days ago
scratch.ipynb
0 Bytes
init v1
24 days ago
scratch.py
Safe
1.61 kB
init v1
24 days ago
verified_agents.yaml
Safe
3.94 kB
minor tweaks
21 days ago