Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
b56511a
core_leaderboard
3 contributors
History:
108 commits
benediktstroebl
Upload requirements.txt
b56511a
verified
7 months ago
agent_monitor
Big update with SQL backend
7 months ago
evals_live
Upload swebench_verified_Agentless_gpt-4o-mini-2024-07-18_50_Instances_1723916965.json
7 months ago
evals_processed
init files to keep dirs open
7 months ago
evals_upload
init files to keep dirs open
7 months ago
utils
Upload viz.py
7 months ago
.gitattributes
Safe
2.05 kB
Upload preprocessed_traces.db
7 months ago
.gitignore
Safe
139 Bytes
Update .gitignore
7 months ago
README copy.md
Safe
14.7 kB
init
7 months ago
README.md
Safe
236 Bytes
initial commit
7 months ago
about.md
Safe
7.17 kB
modified heading and added about tab text
7 months ago
app.py
Safe
86.5 kB
Upload app.py
7 months ago
config.py
Safe
1.62 kB
Upload config.py
7 months ago
css.css
Safe
997 Bytes
vis update
7 months ago
envs.py
Safe
191 Bytes
added auto update
7 months ago
header.md
Safe
118 Bytes
vis update
7 months ago
preprocessed_traces.db
Safe
1.14 GB
LFS
Upload preprocessed_traces.db
7 months ago
requirements.txt
Safe
1.86 kB
Upload requirements.txt
7 months ago
scratch.py
Safe
1.61 kB
vis update
7 months ago
verified_agents.yaml
Safe
1.26 kB
added verified agents management and column and fixed widths
7 months ago