view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 790
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 28 days ago • 40
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated 3 days ago • 14
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 151
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published Dec 23, 2024 • 30
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 93
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 301 items • Updated 2 days ago • 37