Ding's picture

Ding

dyyyyyyyy

·

AI & ML interests

None yet

Recent Activity

new activity 16 days ago

dyyyyyyyy/FAPO-Critic:Add task categories, tags, paper link, and sample usage

new activity 16 days ago

dyyyyyyyy/FAPO-GenRM-4B:Improve model card: Add pipeline tag, library name, paper link, and abstract

authored a paper 17 days ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

View all activity

Organizations

New activity in dyyyyyyyy/FAPO-Critic 16 days ago

Add task categories, tags, paper link, and sample usage

#1 opened 16 days ago by

New activity in dyyyyyyyy/FAPO-GenRM-4B 16 days ago

Improve model card: Add pipeline tag, library name, paper link, and abstract

#1 opened 16 days ago by

commented a paper 17 days ago

FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning

Paper • 2510.22543 • Published 21 days ago • 6 •

commented a paper about 2 months ago

SCAN: Self-Denoising Monte Carlo Annotation for Robust Process Reward Learning

Paper • 2509.16548 • Published Sep 20 •

New activity in dyyyyyyyy/Qwen2.5-1.5B-GenRM-QueryOnly 5 months ago

Possible issue with the new tokenizer config chat template

#1 opened 5 months ago by

New activity in dyyyyyyyy/ScaleQuest-Qwen2-Math-7B-QGen about 1 year ago

Update README.md

#1 opened about 1 year ago by

New activity in dyyyyyyyy/ScaleQuest-DeepSeekMath-7B-QGen about 1 year ago

Update README.md

#1 opened about 1 year ago by

New activity in dyyyyyyyy/DeepSeekMath-7B-ScaleQuest about 1 year ago

Update README.md

#1 opened about 1 year ago by

New activity in dyyyyyyyy/Llama3-8B-ScaleQuest about 1 year ago

add Demo usage

#1 opened about 1 year ago by

New activity in dyyyyyyyy/Mistral-7B-ScaleQuest about 1 year ago

add Demo usage

#1 opened about 1 year ago by

New activity in dyyyyyyyy/Qwen2-Math-7B-ScaleQuest about 1 year ago

add Demo usage

#1 opened about 1 year ago by

commented a paper about 1 year ago

Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch

Paper • 2410.18693 • Published Oct 24, 2024 • 42 •