EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty Paper • 2401.15077 • Published Jan 26 • 18
Fast Inference of Mixture-of-Experts Language Models with Offloading Paper • 2312.17238 • Published Dec 28, 2023 • 7