8 3 6

Jaesun Park

jaesun

jaesuny

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

HyperCLOVA X Technical Report

authored a paper 1 day ago

Kanana: Compute-efficient Bilingual Language Models

liked a Space 2 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

jaesun's activity

authored 2 papers 1 day ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 22

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published 3 days ago • 50

liked a Space 2 days ago

1.79k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 4 months ago

stas/ml-engineering-book

Updated Jan 22 • 16

upvoted a paper 6 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

liked a model 12 months ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 1.11k • 2.27k

upvoted 2 papers about 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 140

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 49

liked a dataset about 2 years ago

bigcode/the-stack-dedup

Viewer • Updated Aug 17, 2023 • 237M • 5.36k • 343

liked a model over 2 years ago

bigscience/bloom

Text Generation • Updated Jul 28, 2023 • 1.16M • 4.86k

liked a Space almost 3 years ago

5.55k

DALL·E mini

🥑