ldwang
ldwang
AI & ML interests
LLM, MLLM, Infra
Recent Activity
updated
a dataset
3 days ago
BAAI/OpenSeek-Pretrain-Data-Examples
published
a dataset
3 days ago
BAAI/OpenSeek-Pretrain-Data-Examples
Organizations
Collections
4
-
526
Scaling test-time compute
πEnhance math problem solving by scaling test-time compute
-
809
FineWeb: decanting the web for the finest text data at scale
π·Generate high-quality web text data for LLM training
-
1.78k
The Ultra-Scale Playbook
πThe ultimate guide to training LLM on large GPU Clusters