|
--- |
|
title: README |
|
emoji: 🏢 |
|
colorFrom: purple |
|
colorTo: blue |
|
sdk: static |
|
pinned: false |
|
--- |
|
|
|
Welcome to the official repository for DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision-Language Models. This repository contains the code, resources, and documentation supporting our paper, which introduces DynaMath: a benchmark designed to rigorously evaluate mathematical reasoning across various vision-language models (VLMs). |
|
|
|
For further details, including the benchmark leaderboard, please visit our [project website](https://dynamath.github.io) and our [preprint paper](https://huan-zhang.com/DynaMath.pdf). |
|
|
|
|
|
<div style="text-align: center;"> |
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/64d45451c34a346181b130dd/vK6Z0E8Qz4xV3yAZlKxq1.png" alt="image/png"> |
|
</div> |
|
|
|
|