How can I evaluate others model on Nexus (0-shot)? How can i get the blog or paper of this benchmark?
· Sign up or log in to comment