Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
galileo-ai
/
agent-leaderboard
like
172
Running
on
CPU Upgrade
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
b0ce6f5
agent-leaderboard
/
output
/
claude-3-5-sonnet-20241022
5 contributors
History:
1 commit
Pratik Bhavsar
added data exploration
b0ce6f5
7 days ago
BFCL_v3_irrelevance.parquet
Safe
47.4 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_base_multi_func_call.parquet
Safe
25.9 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_base_single_func_call.parquet
Safe
25.5 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_composite.parquet
Safe
51.4 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_long_context.parquet
Safe
41 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_miss_func.parquet
Safe
51.2 kB
added data exploration
7 days ago
BFCL_v3_multi_turn_miss_param.parquet
Safe
51.5 kB
added data exploration
7 days ago
tau_long_context.parquet
Safe
48.3 kB
added data exploration
7 days ago
toolace_single_func_call_1.parquet
Safe
20.4 kB
added data exploration
7 days ago
toolace_single_func_call_2.parquet
Safe
13.9 kB
added data exploration
7 days ago
xlam_multiple_tool_multiple_call.parquet
Safe
91.5 kB
added data exploration
7 days ago
xlam_multiple_tool_single_call.parquet
Safe
42.4 kB
added data exploration
7 days ago
xlam_single_tool_multiple_call.parquet
Safe
29 kB
added data exploration
7 days ago
xlam_single_tool_single_call.parquet
Safe
48.3 kB
added data exploration
7 days ago
xlam_tool_miss.parquet
Safe
53.2 kB
added data exploration
7 days ago