Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper • 2408.06195 • Published Aug 12, 2024 • 67
Thinking LLMs: General Instruction Following with Thought Generation Paper • 2410.10630 • Published Oct 14, 2024 • 18
Democratizing Reasoning Ability: Tailored Learning from Large Language Model Paper • 2310.13332 • Published Oct 20, 2023 • 14
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning Paper • 2412.16849 • Published 21 days ago • 8
SRA-MCTS: Self-driven Reasoning Augmentation with Monte Carlo Tree Search for Code Generation Paper • 2411.11053 • Published Nov 17, 2024 • 3
Enhancing LLM Agents for Code Generation with Possibility and Pass-rate Prioritized Experience Replay Paper • 2410.12236 • Published Oct 16, 2024 • 1
Efficiently Serving LLM Reasoning Programs with Certaindex Paper • 2412.20993 • Published 12 days ago • 32