LLM - a meigel Collection

meigel 's Collections

LLM

LLM

updated 1 day ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 7 days ago • 93
ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 78
AceMath: Advancing Frontier Math Reasoning with Post-Training and Reward Modeling

Paper • 2412.15084 • Published Dec 19, 2024 • 13
The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 11 days ago • 85
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 16 days ago • 245
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 17 days ago • 37
System-2 Mathematical Reasoning via Enriched Instruction Tuning

Paper • 2412.16964 • Published Dec 22, 2024
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 15 days ago • 50
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 79
Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 4 days ago • 19
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement

Paper • 2501.12273 • Published 3 days ago • 14
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 7 days ago • 35
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 9 days ago • 268
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 13 days ago • 74