view article Article Replicating DeepSeek R1 for Information Extraction By Ihor • 3 days ago • 19
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 21 days ago • 89
kenhktsui/open-react-retrieval-multi-neg-result-new-kw Viewer • Updated Aug 7, 2023 • 25.2k • 45 • 3
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language Models Paper • 2402.10524 • Published Feb 16, 2024 • 23 • 6