ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3 Text Generation • Updated Dec 20, 2024 • 622 • 14
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 21 days ago • 89
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1, 2024 • 59
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 26 days ago • 252
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 25 days ago • 87
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • Jan 3 • 32