Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

liked a dataset 16 days ago
proj-persona/PersonaHub
liked a model 17 days ago
Qwen/Qwen2.5-3B-Instruct-AWQ
liked a model 17 days ago
Qwen/Qwen2.5-1.5B-Instruct-AWQ
View all activity

Organizations

xszheng2020's activity

upvoted an article about 1 month ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

265
upvoted an article about 1 month ago
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

66
upvoted an article 4 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

104
upvoted an article 5 months ago
view article
Article

RegMix: Data Mixture as Regression for Language Model Pre-training

10