Yu Zhang's picture

Yu Zhang

yzhangcs

·

https://yzhang.site

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

yaofu/slimpajama-per-source-length-upsample

upvoted a paper about 1 month ago

DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

updated a collection about 1 month ago

View all activity

Organizations

yzhangcs's activity

liked a dataset about 1 month ago

yaofu/slimpajama-per-source-length-upsample

Viewer • Updated Feb 15, 2024 • 84.7k • 152 • 18

upvoted a paper about 1 month ago

DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ

Paper • 2405.15306 • Published May 24, 2024 • 7

updated a collection about 1 month ago

🔥 flame

A collection of baselines trained by 🔥 flame • 2 items • Updated Mar 18

updated a model about 1 month ago

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-decay0.1-sqrt

Updated Mar 14 • 1

published a model about 1 month ago

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-decay0.1-sqrt

Updated Mar 14 • 1

updated a collection about 2 months ago

🔥 flame

A collection of baselines trained by 🔥 flame • 2 items • Updated Mar 18

updated a model about 2 months ago

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-cosine

Updated Mar 14 • 6 • 1

published a model about 2 months ago

fla-hub/transformer-340M-4K-0.5B-20480-lr3e-4-cosine

Updated Mar 14 • 6 • 1

upvoted a paper 2 months ago

LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid

Paper • 2502.07563 • Published Feb 11 • 24

liked a dataset 2 months ago

Congliu/Chinese-DeepSeek-R1-Distill-data-110k

Viewer • Updated Feb 21 • 110k • 2.54k • 647

upvoted a collection 2 months ago

Deepseek Papers

Deepseek papers collection • 19 items • Updated 24 days ago • 191

New activity in fla-hub/rwkv7-2.9B-world 2 months ago

bfloat16 safetensors as required by Peng Bo

#1 opened 2 months ago by

updated a collection 2 months ago

Qwen2.5

6 items • Updated Mar 18

updated a model 2 months ago

fla-hub/transformer-3B-qwen2.5

Updated Feb 13 • 1

published a model 2 months ago

fla-hub/transformer-3B-qwen2.5

Updated Feb 13 • 1

updated a model 2 months ago

fla-hub/transformer-3B-qwen2.5-instruct

published a model 2 months ago

fla-hub/transformer-3B-qwen2.5-instruct

updated a model 2 months ago

fla-hub/transformer-1.5B-qwen2.5-instruct

updated a collection 2 months ago

Qwen2.5

6 items • Updated Mar 18