Spaces:
Running
Running
model,agentbench | |
gpt-4-0613,4.01 | |
claude-2,2.49 | |
claude-v1.3,2.44 | |
gpt-3.5-turbo-0613,2.32 | |
text-davinci-003,1.71 | |
claude-instant-v1.1,1.60 | |
chat-bison-001,1.39 | |
text-davinci-002,1.25 | |
llama-2-70b-chat,0.78 | |
guanaco-65b,0.54 | |
codellama-34b-instruct,0.96 | |
vicuna-33b-v1.3,0.73 | |
wizardlm-30b-v1.0,0.46 | |
guanaco-33b,0.39 | |
vicuna-13b-v1.5,0.93 | |
llama-2-13b-chat,0.77 | |
openchat-13b-v3.2,0.70 | |
wizardlm-13b-v1.2,0.66 | |
vicuna-7b-v1.5,0.56 | |
codellama-13b-instruct,0.56 | |
codellama-7b-instruct,0.50 | |
koala-13b,0.34 | |
llama-2-7b-chat,0.34 | |
codegeex2-6b,0.27 | |
dolly-12b-v2,0.14 | |
chatglm-6b-v1.1,0.11 | |
oasst-12b-sft-4,0.03 |