view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 108
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 19 days ago • 29
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models Paper • 2108.08877 • Published Aug 19, 2021 • 2
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper • 2407.16741 • Published Jul 23, 2024 • 70
GritLM Collection Generative Representational Instruction Tuning (GRIT) • 64 items • Updated Apr 17, 2024 • 8
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension Paper • 1910.13461 • Published Oct 29, 2019 • 3