Spaces:
Running
Running
File size: 550 Bytes
41c62ec 89bcda9 0f62e55 41c62ec 89bcda9 41c62ec 89bcda9 a48f402 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
---
title: Verbal Reasoning Challenge
emoji: 🤔
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.15.0
app_file: app.py
pinned: false
license: bsd-3-clause
---
# PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models
This application presents the results of several models that we have
evaluated on a verbal reasoning challenge
([Papers](https://huggingface.co./papers/2502.01584),
[ArXiv](https://arxiv.org/abs/2502.01584)).
The overall results are below. Use the tabs above to explore the results in more detail.
|