SwiftKV Models Collection SwiftKV reduces prefill compute by up to 50% by combining model rewiring and knowledge-preserving self-distillation. • 4 items • Updated about 15 hours ago • 5
Arctic-embed Collection A collection of text embedding models optimized for retrieval accuracy and efficiency • 8 items • Updated Dec 5, 2024 • 17