8 3 6

Jaesun Park

jaesun

jaesuny

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

HyperCLOVA X Technical Report

authored a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

liked a Space about 2 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

jaesun's activity

authored 2 papers about 2 months ago

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 24

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

liked a Space about 2 months ago

2.5k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 6 months ago

stas/ml-engineering-book

Updated 24 days ago • 16

upvoted a paper 8 months ago

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126

liked a model about 1 year ago

xai-org/grok-1

Text Generation • Updated Mar 28, 2024 • 298 • 2.3k

upvoted 2 papers about 1 year ago

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 143

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 50

liked a dataset about 2 years ago

bigcode/the-stack-dedup

Viewer • Updated Aug 17, 2023 • 237M • 7.24k • 351

liked a model almost 3 years ago

bigscience/bloom

Text Generation • Updated Jul 28, 2023 • 3.93k • 4.89k

liked a Space almost 3 years ago

5.57k

DALL·E mini

🥑