DataComp

non-profit

https://www.datacomp.ai/dclm/index.html#home

AI & ML interests

None defined yet.

Recent Activity

AmeyaPrabhu authored a paper 1 day ago

A Practitioner's Guide to Continual Multimodal Pretraining

AmeyaPrabhu authored a paper 1 day ago

CiteME: Can Language Models Accurately Cite Scientific Claims?

AmeyaPrabhu authored a paper 1 day ago

Open Problems in Machine Unlearning for AI Safety

View all activity

dclm's activity

AmeyaPrabhu

authored 7 papers 1 day ago

A Practitioner's Guide to Continual Multimodal Pretraining

Paper • 2408.14471 • Published Aug 26, 2024

CiteME: Can Language Models Accurately Cite Scientific Claims?

Paper • 2407.12861 • Published Jul 10, 2024

Open Problems in Machine Unlearning for AI Safety

Paper • 2501.04952 • Published Jan 9 • 2

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 63

Corrective Machine Unlearning

Paper • 2402.14015 • Published Feb 21, 2024

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published 2 days ago • 16

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published 2 days ago • 14

pengyuan

authored a paper 1 day ago

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

Paper • 2502.09927 • Published 15 days ago

ranpox

authored a paper 8 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 9 days ago • 150

wannaphong

authored a paper 9 days ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published 10 days ago • 13

Lewis-Lau

authored a paper 9 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 99

bencw

authored a paper 15 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 15 days ago • 32

AmeyaPrabhu

authored a paper 21 days ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published 22 days ago • 30

thomwolf

authored a paper 22 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

weizechen

authored a paper 24 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 25 days ago • 54

lx865712528

authored a paper about 1 month ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 36

Wanfq

authored 2 papers about 1 month ago

BlockPruner: Fine-grained Pruning for Large Language Models

Paper • 2406.10594 • Published Jun 15, 2024

ProFuser: Progressive Fusion of Large Language Models

Paper • 2408.04998 • Published Aug 9, 2024

lx865712528

authored a paper about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

yentinglin

authored a paper about 1 month ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15