ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted an article 2 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

liked a dataset 2 days ago

dgslibisey/MuSiQue

commented on a paper 2 days ago

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

View all activity

Organizations

ldwang's activity

upvoted an article 2 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 834

liked a dataset 2 days ago

dgslibisey/MuSiQue

Viewer • Updated Jun 16, 2023 • 22.4k • 1.07k • 6

commented a paper 2 days ago

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Paper • 2503.18929 • Published 12 days ago • 3 •

upvoted a paper 2 days ago

Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training

Paper • 2503.18929 • Published 12 days ago • 3

upvoted a paper 4 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 8 days ago • 43

updated a collection 6 days ago

MiscR1

1 item • Updated 6 days ago

liked a model 9 days ago

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated Feb 23 • 61.2k • • 529

New activity in BAAI/Infinity-MM 9 days ago

数据集中有的图片是jpg，但实际上是png

#13 opened 10 days ago by

liked a dataset 10 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 18 days ago • 15.2M • 11.6k • 315

liked a model 11 days ago

junnyu/DeepScaleR-1.5B-Preview-Reproduce

Text Generation • Updated Feb 26 • 68 • 3

liked a dataset 11 days ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10 • 40.3k • 4.34k • 90

upvoted 2 collections 17 days ago

OpenSeek

OpenSeek • 3 items • Updated Feb 25 • 2

Aquila

22 items • Updated 27 days ago • 4

updated a collection 17 days ago

MiscDatasets

5 items • Updated 17 days ago • 1

liked a dataset 17 days ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 418 • 92

liked a Space 22 days ago

Chat with DeepSeek-VL2-small

Generate responses using images and text input

liked a dataset 24 days ago

BAAI/CCI-Data

Updated Dec 17, 2024 • 74 • 68

upvoted a collection 27 days ago

SimpleRL

The collection for the Project "Simple Reinforcement Learning for Reasoning" • 2 items • Updated Feb 19 • 6

updated a collection 29 days ago

MiscModels

6 items • Updated 29 days ago • 1

liked a model 29 days ago

deepseek-ai/deepseek-vl2-tiny

Image-Text-to-Text • Updated Dec 18, 2024 • 87.4k • 173