Tristan Thrush's picture

Tristan Thrush

Tristan

·

http://www.tristanthrush.com/

AI & ML interests

NLP, Datasets, Multimodality

Recent Activity

upvoted an article about 1 month ago

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

updated a model about 2 months ago

Tristan/dclm-perplexity-correlations-410m-3

updated a model about 2 months ago

Tristan/dclm-perplexity-correlations-160m-3

View all activity

Organizations

Tristan's activity

upvoted an article about 1 month ago

Article

Optimizing Pretraining Data Mixes with LLM-Estimated Utility

By

•

Jan 22

• 3

updated 6 models about 2 months ago

Tristan/dclm-perplexity-correlations-410m-3

Text Generation • Updated Jan 13 • 17

Tristan/dclm-perplexity-correlations-160m-3

Text Generation • Updated Jan 13 • 13

Tristan/dclm-random-410m

Text Generation • Updated Jan 13 • 13

Tristan/dclm-random-160m

Text Generation • Updated Jan 13 • 12

Tristan/dclm-multilingual-410m

Text Generation • Updated Jan 13 • 12

Tristan/dclm-multilingual-160m

Text Generation • Updated Jan 13 • 18

updated 13 models 3 months ago

Tristan/dclm-perplextiy-correlations-410m-2

Text Generation • Updated Dec 12, 2024 • 50

Tristan/dclm-perplextiy-correlations-160m-2

Text Generation • Updated Dec 12, 2024 • 50

Tristan/dclm-perplexity-correlations-spearmanr-no-samp-410m

Text Generation • Updated Nov 22, 2024 • 173

Tristan/dclm-perplexity-correlations-spearmanr-no-samp-160m

Text Generation • Updated Nov 22, 2024 • 171

Tristan/dclm-perplexity-correlations-spearmanr-410m

Text Generation • Updated Nov 22, 2024 • 169

Tristan/dclm-perplexity-correlations-spearmanr-160m

Text Generation • Updated Nov 22, 2024 • 167

Tristan/dclm-perplexity-correlations-1b

Text Generation • Updated Nov 22, 2024 • 137

Tristan/dclm-perplexity-correlations-410m

Text Generation • Updated Nov 21, 2024 • 57

Tristan/dclm-perplexity-correlations-160m

Text Generation • Updated Nov 21, 2024 • 68

Tristan/dclm-perplexity-correlations-160m-smol

Text Generation • Updated Nov 20, 2024 • 64

Tristan/dclm-perplexity-correlations-160m-target-to-be-bad

Text Generation • Updated Nov 19, 2024 • 62

Tristan/dclm-fasttext-oh-eli5-1b

Text Generation • Updated Nov 19, 2024 • 61

Tristan/dclm-uniform-1b

Text Generation • Updated Nov 18, 2024 • 72