Pengxiang Li's picture

Pengxiang Li

pengxiang

·

pixeli99

AI & ML interests

Video generation, Image editing, AD

Recent Activity

upvoted a paper 4 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

upvoted a paper 10 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

updated a model 11 days ago

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop

View all activity

Organizations

None yet

Collections 1

Papers 8

arxiv:2504.14239

arxiv:2502.05795

arxiv:2501.04575

arxiv:2412.13795

models 10

pengxiang/Qwen2.5-1.5B-Open-R1-Distill-loop

Updated 11 days ago • 4

pengxiang/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • Updated 12 days ago • 8

pengxiang/Qwen2.5-1.5B-Open-R1-GRPO

Updated 14 days ago

pengxiang/LNS_1B

Updated Mar 2 • 5 • 1

pengxiang/TrackDiffusion_SVD_Stage2

Text-to-Video • Updated Jan 9

pengxiang/TrackDiffusion_SVD_Stage1

Text-to-Video • Updated Jan 9

pengxiang/TrackDiffusion_Pretrain

Updated Apr 22, 2024 • 3 • 1

pengxiang/GLIGEN_1_4

Updated Apr 10, 2024 • 3

pengxiang/TrackDiffusion_ModelScope

Text-to-Video • Updated Apr 8, 2024

pengxiang/trackdiffusion_ytvis

Text-to-Video • Updated Mar 29, 2024 • 2

datasets 16

pengxiang/coins_new

Viewer • Updated 14 days ago • 4.91k • 384

pengxiang/COIN

Viewer • Updated 15 days ago • 528 • 13

pengxiang/tvqa

Preview • Updated 22 days ago • 83

pengxiang/COINs

Viewer • Updated 25 days ago • 1.59k • 632

pengxiang/sthv2

Updated 25 days ago • 42

pengxiang/youcook2

Updated 27 days ago • 130

pengxiang/UVO

Viewer • Updated 27 days ago • 799 • 148

pengxiang/youcook

Viewer • Updated 27 days ago • 407 • 147

pengxiang/clevrer

Viewer • Updated Apr 4 • 10k • 37

pengxiang/oops

Updated Apr 4 • 27