File size: 550 Bytes
41c62ec
89bcda9
0f62e55
41c62ec
 
 
89bcda9
41c62ec
 
 
 
 
89bcda9
 
 
a48f402
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
title: Verbal Reasoning Challenge
emoji: 🤔
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.15.0
app_file: app.py
pinned: false
license: bsd-3-clause
---

# PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

This application presents the results of several models that we have
evaluated on a verbal reasoning challenge 
([Papers](https://huggingface.co./papers/2502.01584), 
[ArXiv](https://arxiv.org/abs/2502.01584)).
The overall results are below. Use the tabs above to explore the results in more detail.