Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hal
community
https://github.com/benediktstroebl/agent-eval-harness/tree/main
benediktstroebl
benediktstroebl
Activity Feed
Follow
5
AI & ML interests
None defined yet.
Recent Activity
benediktstroebl
updated
a dataset
about 1 month ago
agent-evals/agent_traces
benediktstroebl
updated
a dataset
about 1 month ago
agent-evals/agent_traces
benediktstroebl
authored
a paper
9 months ago
AI Agents That Matter
View all activity
Team members
4
spaces
2
Sort: Recently updated
Running
Agent Leaderboard
🏆
Display agent leaderboards for various benchmarks
Running
Agent Leaderboard
🏆
models
None public yet
datasets
2
Sort: Recently updated
agent-evals/agent_traces
Updated
Feb 27
•
462
agent-evals/results
Updated
Jan 16
•
9