Dongfu Jiang's picture

Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

updated a model about 3 hours ago

CodeDPO/AceCoder-Qwen2.5-Coder-1.5B-Ins-RM

published a model about 3 hours ago

CodeDPO/AceCoder-Qwen2.5-Coder-1.5B-Ins-RM

updated a model about 22 hours ago

CodeDPO/qwen25-coder-1.5b-inst-reinforce-plus_new_dataset_hard_r1

View all activity

Organizations

Papers 11

arxiv:2502.01718

arxiv:2410.10563

arxiv:2406.15252

arxiv:2406.11069

models 40

DongfuJiang/math_ct_adapt_qwen2.5_1.5B

Updated 26 days ago • 7

DongfuJiang/math_ct_qwen2.5_1.5B

Updated 26 days ago • 6

DongfuJiang/Qwen2-VL-VAE-7B-Instruct

Image-Text-to-Text • Updated Dec 17, 2024 • 8

DongfuJiang/Qwen2-VL-VAE-7B-Instruct-mochi-vae

Text2Text Generation • Updated Dec 17, 2024 • 7

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_pt

Text Generation • Updated Dec 9, 2024 • 11

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_12_sft

Text Generation • Updated Dec 9, 2024 • 5

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_pt

Text Generation • Updated Dec 9, 2024 • 6

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_and_end_2_6_sft

Text Generation • Updated Dec 9, 2024 • 10

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_pt

Text Generation • Updated Dec 7, 2024 • 7

DongfuJiang/qwen2_chunking_mlp_freeze_uniform_with_shared_start_sft

Updated Dec 7, 2024

datasets 12

DongfuJiang/Big-Math-RL-Verified-CT

Viewer • Updated 25 days ago • 17.8k • 917

DongfuJiang/PRM_SFT

Viewer • Updated Dec 1, 2024 • 4.01M • 7

DongfuJiang/zeroeval

Viewer • Updated Nov 27, 2024 • 13.5k • 84

DongfuJiang/PRM_eval

Viewer • Updated Nov 27, 2024 • 9.54k • 4

DongfuJiang/eval

Viewer • Updated Nov 27, 2024 • 6k • 62

DongfuJiang/PRM_prepared

Viewer • Updated Nov 26, 2024 • 39.9k • 8

DongfuJiang/PRM_train

Viewer • Updated Nov 25, 2024 • 32.7k • 5

DongfuJiang/MATH-500

Viewer • Updated Nov 6, 2024 • 500 • 47

DongfuJiang/simpo_v2_ultrafeedback

Viewer • Updated Aug 2, 2024 • 59.9k • 47

DongfuJiang/VAPO

Viewer • Updated Jul 31, 2024 • 72.5k • 10