6 9 14

Minho Ryu

bzantium

bzantium

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

upvoted a collection about 2 months ago

Kanana Nano 2.1B

upvoted a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

View all activity

Organizations

bzantium's activity

authored a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

upvoted a collection about 2 months ago

Kanana Nano 2.1B

Collection

Open Source SLM • 8 items • Updated Feb 27 • 14

upvoted a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

commented a paper about 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66 •

liked 3 models about 2 months ago

updated a model 2 months ago

bzantium/deepseek-v3-test

Updated Feb 15 • 2

published a model 2 months ago

bzantium/deepseek-v3-test

Updated Feb 15 • 2

updated a model 3 months ago

bzantium/tiny-deepseek-v3

Updated Jan 29 • 2.53k

published a model 3 months ago

bzantium/tiny-deepseek-v3

Updated Jan 29 • 2.53k

updated a dataset 3 months ago

bzantium/MMMLU

Viewer • Updated Jan 19 • 393k • 106

published a dataset 3 months ago

bzantium/MMMLU

Viewer • Updated Jan 19 • 393k • 106

upvoted a paper 5 months ago

Upcycling Large Language Models into Mixture of Experts

Paper • 2410.07524 • Published Oct 10, 2024 • 4

New activity in EleutherAI/polyglot-ko-5.8b 6 months ago

#1 opened 6 months ago by

hyerong

upvoted a paper 7 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179

upvoted a collection 9 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 663

upvoted a paper 10 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 96

liked a Space 11 months ago

924

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

Minho Ryu

AI & ML interests

Recent Activity

Organizations

bzantium's activity

모델 저작권 문의 [copyrights]

FineWeb: decanting the web for the finest text data at scale