1 6 53

Yiming Zheng

ZYM666

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

liked a dataset about 2 months ago

GAIR/LIMO

liked a dataset about 2 months ago

simplescaling/s1K

View all activity

Organizations

None yet

ZYM666's activity

upvoted a paper about 1 month ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 120

liked 2 datasets about 2 months ago

GAIR/LIMO

Viewer • Updated Feb 10 • 817 • 4.25k • 149

simplescaling/s1K

Viewer • Updated Feb 11 • 1k • 2.36k • 212

liked 3 datasets 2 months ago

liked 2 models 3 months ago

peiyi9979/math-shepherd-mistral-7b-prm

Text Generation • Updated Jan 15, 2024 • 3.04k • 47

deepseek-ai/DeepSeek-R1

Text Generation • Updated 27 days ago • 1.76M • • 12k

liked a dataset 3 months ago

HuggingFaceH4/prm800k-trl-dedup

Viewer • Updated Jan 9 • 379k • 120 • 2

liked a model 3 months ago

microsoft/phi-4

Text Generation • Updated Feb 24 • 424k • • 2k

liked a dataset 3 months ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 347 • 96

liked a model 3 months ago

google/gemma-2-9b

Text Generation • Updated Aug 7, 2024 • 118k • 654

liked a Space 3 months ago

548

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

authored a paper 4 months ago

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework

Paper • 2412.11713 • Published Dec 16, 2024 • 5

upvoted a paper 4 months ago

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework

Paper • 2412.11713 • Published Dec 16, 2024 • 5

updated a Space 4 months ago

ECS

🚀

liked 2 datasets 4 months ago

THUDM/LongBench-v2

Viewer • Updated Dec 20, 2024 • 503 • 5.81k • 14

THUDM/LongBench

Viewer • Updated Dec 18, 2024 • 8.42k • 56.5k • 141

liked a model 5 months ago

deepseek-ai/deepseek-math-7b-instruct

Text Generation • Updated Feb 6, 2024 • 51.1k • 123

liked a dataset 5 months ago

hotpotqa/hotpot_qa

Updated Jan 18, 2024 • 10.5k • 120