DRAMA Collection A collection of small (sub-1B) multilingual dense retrievers that generalize well across a number of tasks and languages. β’ 3 items β’ Updated 2 days ago β’ 2
Granite Guardian Models Collection A collection of models created by IBM for safeguarding language models. β’ 13 items β’ Updated 4 days ago β’ 16
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org β’ 3 items β’ Updated 2 days ago β’ 47
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published 8 days ago β’ 118
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior Paper β’ 2310.16818 β’ Published Oct 25, 2023 β’ 32
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling Paper β’ 2501.17811 β’ Published about 1 month ago β’ 6
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published 12 days ago β’ 134
SkyReels-V1 Collection SkyReels V1 open models collections β’ 2 items β’ Updated 11 days ago β’ 17
Llasa Collection TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) β’ 11 items β’ Updated 8 days ago β’ 7
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS β’ 3 items β’ Updated 11 days ago β’ 28
OLMoE (January 2025) Collection Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app β’ 10 items β’ Updated 17 days ago β’ 9
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) β’ 8 items β’ Updated 12 days ago β’ 52