Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains'
Yi Su
virtuoussy
AI & ML interests
None yet
Recent Activity
new activity
4 days ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:Github address
new activity
6 days ago
virtuoussy/Qwen2.5-7B-Instruct-RLVR:如何使用
new activity
12 days ago
virtuoussy/Multi-subject-RLVR:About subject information.
Organizations
None yet