Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 12 days ago • 34
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 12 days ago • 34
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding Paper • 2411.18462 • Published Nov 27, 2024 • 6
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 224