CSDDSFSFSAFSAF
/

Reflect-R1

Reinforcement Learning

video-language-model

long-video-understanding

self-correction

Model card Files Files and versions

33.2 GB

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

CSDDSFSFSAFSAF's picture

Add Reflect-R1 GRPO final checkpoint

e0395e5 verified 1 day ago