arxiv:2411.00836
Xingang Guo
optizer
AI & ML interests
LLMs, ML, RL, optimization
Recent Activity
authored
a paper
about 1 month ago
COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
authored
a paper
about 1 month ago
Capabilities of Large Language Models in Control Engineering: A
Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra
authored
a paper
about 1 month ago
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical
Reasoning Robustness of Vision Language Models