Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
benediktstroebl/hal
agent-evals
/
core_leaderboard
like
0
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
e23eddc
core_leaderboard
3 contributors
History:
76 commits
benediktstroebl
Delete evals_live/swebench_verified_Agentless_gpt-4o-2024-07-18_50_Instances_1723916965.json
e23eddc
verified
7 months ago
agent_monitor
added timestamp to task summary prompt for failure report and fixed failure report gradio issue
7 months ago
evals_live
Delete evals_live/swebench_verified_Agentless_gpt-4o-2024-07-18_50_Instances_1723916965.json
7 months ago
evals_processed
init files to keep dirs open
7 months ago
evals_upload
init files to keep dirs open
7 months ago
utils
added failure report and two new swebench variants
7 months ago
.gitattributes
Safe
1.99 kB
update
7 months ago
.gitignore
Safe
104 Bytes
Update .gitignore
7 months ago
README copy.md
Safe
14.7 kB
init
7 months ago
README.md
Safe
236 Bytes
initial commit
7 months ago
about.md
Safe
36 Bytes
update
7 months ago
app.py
Safe
28.1 kB
added timestamp to task summary prompt for failure report and fixed failure report gradio issue
7 months ago
config.py
Safe
722 Bytes
layout update
7 months ago
css.css
Safe
2.54 kB
init
7 months ago
envs.py
Safe
191 Bytes
added auto update
7 months ago
requirements.txt
Safe
1.85 kB
update
7 months ago