DynaMath Team

university

https://github.com/DynaMath

DynaMath

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

optizer authored a paper about 2 months ago

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

optizer authored a paper about 2 months ago

Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

optizer authored a paper about 2 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

View all activity

DynaMath's activity

optizer

authored 3 papers about 2 months ago

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Paper • 2402.08679 • Published Feb 13, 2024 • 1

Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

Paper • 2404.03647 • Published Apr 4, 2024

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

huanzhang12

authored a paper 2 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

jyzhang1208

authored a paper 2 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

OwenZou

authored a paper 2 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

Ray2333

authored a paper 2 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15

Ray2333

updated a dataset 2 months ago

DynaMath/DynaMath_Sample

Viewer • Updated Nov 5, 2024 • 5.01k • 243 • 6

OwenZou

updated a dataset 2 months ago

DynaMath/DynaMath_Sample

Viewer • Updated Nov 5, 2024 • 5.01k • 243 • 6

Ray2333

updated a Space 2 months ago

Running

🏢

README

Ray2333

authored 2 papers 6 months ago

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

Paper • 2310.12955 • Published Oct 19, 2023 • 1

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Paper • 2402.10207 • Published Feb 15, 2024 • 2

Ray2333

authored a paper 7 months ago

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

Paper • 2406.10216 • Published Jun 14, 2024 • 2

AI & ML interests

Recent Activity

Team members 5

DynaMath's activity

README