Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
cb163b3
core_leaderboard
/
utils
3 contributors
History:
10 commits
benediktstroebl
updated width of plot
be40ce5
7 months ago
data.py
Safe
9.47 kB
format update and added monitor llm client backend
7 months ago
pareto.py
Safe
1.34 kB
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard
7 months ago
processing.py
Safe
5.97 kB
update to avoid automatic processing
7 months ago
viz.py
Safe
8.58 kB
updated width of plot
7 months ago