Writing in the Margins: Better Inference Pattern for Long Context Retrieval Paper • 2408.14906 • Published 27 days ago • 137
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts Paper • 2401.04081 • Published Jan 8 • 70