Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18 • 52
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Paper • 2406.13035 • Published Jun 18 • 3
A Survey on Model Compression for Large Language Models Paper • 2308.07633 • Published Aug 15, 2023 • 3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression Paper • 2403.07378 • Published Mar 12 • 3