Luca Soldaini's picture

Luca Soldaini

soldni

·

https://soldaini.net

AI & ML interests

question answering, information retrieval, scientific document processing

Recent Activity

liked a model 1 day ago

allenai/olmOCR-7B-0225-preview

updated a dataset 4 days ago

allenai/olmOCR-mix-0225

published a dataset 4 days ago

allenai/olmOCR-mix-0225

View all activity

Organizations

soldni's activity

upvoted 5 collections 18 days ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 18 days ago • 64

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated 16 days ago • 91

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 18 days ago • 71

OLMoE (November 2024)

Artifacts for open mixture-of-experts language models. • 13 items • Updated 18 days ago • 29

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 17 days ago • 9

upvoted a collection 3 months ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 18 days ago • 83

upvoted a paper 3 months ago

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published Nov 21, 2024 • 30

upvoted a collection 5 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 18 days ago • 297

upvoted a collection 6 months ago

OLMo Suite

Artifacts for the first set of OLMo models. • 18 items • Updated 18 days ago • 71

upvoted 2 papers about 1 year ago

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 83

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

upvoted a collection about 1 year ago

Paloma

Dataset and baseline models for Paloma, a benchmark of language model fit to 546 textual domains • 8 items • Updated 18 days ago • 15