Function Calling Evaluation bench Nexus (0-shot)

#41
by WateBear - opened

How can I evaluate others model on Nexus (0-shot)? How can i get the blog or paper of this benchmark?

Sign up or log in to comment