Merge branch 'main' of https://huggingface.co./spaces/agent-evals/leaderboard b585234 benediktstroebl commited on Aug 12, 2024
Upload usaco_USACO_Episodic_gpt-4o-mini-2024-07-18_1723429624.json 19f1cd0 unverified benediktstroebl commited on Aug 12, 2024
Delete evals_live/usaco_USACO_Episodic_gpt-4o-mini-2024-07-18_1723429624.json 4cf2b30 unverified benediktstroebl commited on Aug 12, 2024
Upload usaco_USACO_Semantic_gpt-4o-mini-2024-07-18_1723431631.json 7380536 unverified benediktstroebl commited on Aug 12, 2024
Delete evals_live/usaco_usaco_test_172306727812321123.json d3e9bdb unverified benediktstroebl commited on Aug 12, 2024
Delete evals_live/usaco_usaco_example_agent_1722871527.json 73428db unverified benediktstroebl commited on Aug 12, 2024
Delete evals_live/usaco_usaco_example_agent_1722871.json 317b884 unverified benediktstroebl commited on Aug 12, 2024
Upload usaco_USACO_Episodic_gpt-4o-mini-2024-07-18_1723429624.json 3ee1461 unverified benediktstroebl commited on Aug 12, 2024
Upload usaco_USACO_Zero-shot_gpt-4o-mini-2024-07-18_1723417375.json b0b576a unverified benediktstroebl commited on Aug 12, 2024
new data structure with global dict for faster processing f9140ad benediktstroebl commited on Aug 11, 2024
big update with raw predictions section and dropdowns that dynamically parse agents of current leaderboard ca89148 benediktstroebl commited on Aug 11, 2024
added initial version of visibility feature and fixed automatic update of results every hour 0b3117f benediktstroebl commited on Aug 9, 2024
Merge branch 'main' of https://huggingface.co./spaces/agent-evals/leaderboard b0d26e5 benediktstroebl commited on Aug 4, 2024