Spaces:

Duplicated from benediktstroebl/hal

agent-evals
/

core_leaderboard

Running

App Files Files Community

core_leaderboard / utils

3 contributors

History: 10 commits

benediktstroebl's picture

benediktstroebl

updated width of plot

be40ce5 7 months ago

data.py

9.47 kB

format update and added monitor llm client backend 7 months ago
pareto.py

1.34 kB

big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard 7 months ago
processing.py

5.97 kB

update to avoid automatic processing 7 months ago
viz.py

8.58 kB

updated width of plot 7 months ago