3 5 7

wangpeiyi

peiyi9979

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

liked a model 3 months ago

deepseek-ai/DeepSeek-R1

liked a model 3 months ago

deepseek-ai/DeepSeek-R1-Zero

View all activity

Organizations

peiyi9979's activity

upvoted a paper 13 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 385

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 27 days ago • 1.76M • • 12k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 27 days ago • 5.59k • 901

upvoted a paper 7 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 73

liked a model 12 months ago

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8, 2024 • 130k • 316

upvoted 2 papers about 1 year ago

An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Paper • 2403.06764 • Published Mar 11, 2024 • 29

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 116

updated 3 models over 1 year ago

New activity in peiyi9979/math-shepherd-mistral-7b-prm over 1 year ago

Why does the config show this is a LLaMA model?

#1 opened over 1 year ago by

tongyx361

liked a model over 1 year ago

deepseek-ai/deepseek-moe-16b-chat

Text Generation • Updated Feb 5, 2024 • 29.9k • 136

upvoted a paper over 1 year ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 47

liked a dataset over 1 year ago

MMInstruction/VLFeedback

Viewer • Updated Oct 17, 2024 • 80.3k • 325 • 46

updated a dataset over 1 year ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 347 • 96

liked a dataset over 1 year ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 347 • 96

New activity in peiyi9979/Math-Shepherd over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

authored a paper almost 2 years ago

M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

Paper • 2306.04387 • Published Jun 7, 2023 • 8

liked a dataset almost 2 years ago

MMInstruction/M3IT

Updated Nov 24, 2023 • 18.9k • 124

New activity in bigcode/the-stack-dedup almost 2 years ago

bigcode/the-stack-dedup have no 2.7TB?

#14 opened almost 2 years ago by

peiyi9979