If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published Dec 5, 2024 • 4
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published Dec 5, 2024 • 4 • 2
A PhD Student's Perspective on Research in NLP in the Era of Very Large Language Models Paper • 2305.12544 • Published May 21, 2023 • 1
Discriminator-Guided Multi-step Reasoning with Language Models Paper • 2305.14934 • Published May 24, 2023 • 1
A Distributional Approach to Controlled Text Generation Paper • 2012.11635 • Published Dec 21, 2020
Source-Aware Training Enables Knowledge Attribution in Language Models Paper • 2404.01019 • Published Apr 1, 2024 • 1
Small Language Models Need Strong Verifiers to Self-Correct Reasoning Paper • 2404.17140 • Published Apr 26, 2024
Learning to Reason via Program Generation, Emulation, and Search Paper • 2405.16337 • Published May 25, 2024
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs Paper • 2410.02899 • Published Oct 3, 2024
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs Paper • 2412.04144 • Published Dec 5, 2024 • 4
On Leakage of Code Generation Evaluation Datasets Paper • 2407.07565 • Published Jul 10, 2024 • 5