1 44 182

Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

updated a dataset 5 days ago

xszheng2020/s1K_tokenized_llama

published a dataset 5 days ago

xszheng2020/s1K_tokenized_llama

liked a dataset 9 days ago

simplescaling/data_ablation_full59K

View all activity

Organizations

xszheng2020's activity

updated a dataset 5 days ago

xszheng2020/s1K_tokenized_llama

Viewer • Updated 5 days ago • 1k • 57

published a dataset 5 days ago

xszheng2020/s1K_tokenized_llama

Viewer • Updated 5 days ago • 1k • 57

liked a dataset 9 days ago

simplescaling/data_ablation_full59K

Viewer • Updated Feb 3 • 60.4k • 1.73k • 19

upvoted a paper 9 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 9 days ago • 87

liked a model 12 days ago

m-a-p/neo_7b

Text Generation • Updated Jun 3, 2024 • 222 • 54

upvoted a collection 12 days ago

OLMo 2 Preview Post-trained Models

Collection

These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions. • 6 items • Updated 2 days ago • 3

liked a model 12 days ago

allenai/OLMo-2-1124-7B-Instruct

Text Generation • Updated Jan 6 • 13.1k • 30

liked a dataset 12 days ago

simplescaling/s1K-1.1

Viewer • Updated 16 days ago • 1k • 7.08k • 89

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 204

liked a model about 1 month ago

allenai/OLMo-1B-0724-hf

Text Generation • Updated Aug 5, 2024 • 99.8k • 19

liked a dataset about 1 month ago

lkevinzc/CountDownZero

Viewer • Updated Feb 1 • 329k • 140 • 1

liked a model about 1 month ago

Qwen/Qwen2.5-0.5B

Text Generation • Updated Sep 25, 2024 • 550k • • 231

upvoted an article about 1 month ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 296

liked a dataset about 1 month ago

Jiayi-Pan/Countdown-Tasks-3to4

Viewer • Updated Jan 23 • 490k • 14.1k • 46

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 20 days ago • 9.91k • 865

upvoted a collection about 2 months ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 576

liked 2 models 2 months ago

microsoft/phi-4

Text Generation • Updated 19 days ago • 500k • • 1.9k

Snowflake/snowflake-arctic-embed-m-v2.0

upvoted an article 2 months ago

Article

Fine-tune ModernBERT for text classification using synthetic data

•

Dec 30, 2024

• 32

liked a dataset 3 months ago

davidbrandfonbrener/color-filtered-c4

Viewer • Updated Jun 18, 2024 • 5.14M • 91 • 3