MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published 2 days ago • 72
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning Paper • 2406.12050 • Published Jun 17, 2024 • 19
SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark Paper • 2402.05138 • Published Feb 6, 2024 • 2
SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark Paper • 2402.05138 • Published Feb 6, 2024 • 2
MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning Paper • 2307.07951 • Published Jul 16, 2023
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation Paper • 2305.14386 • Published May 22, 2023
What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks Paper • 2305.18365 • Published May 27, 2023 • 4
What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks Paper • 2305.18365 • Published May 27, 2023 • 4