Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Paper โข 2501.13629 โข Published 11 days ago โข 40
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents Paper โข 2407.16741 โข Published Jul 23, 2024 โข 70
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers Paper โข 2305.07185 โข Published May 12, 2023 โข 9