Rui Yang's picture

9 7 18

Rui Yang

Ray2333

·

https://yangrui2015.github.io

YangRui2015

AI & ML interests

Deep Reinforcement Learning

Recent Activity

liked a model 2 months ago

Ray2333/GRM_Llama3.1_8B_rewardmodel-ft

updated a model 2 months ago

Ray2333/GRM-Llama3-8B-rewardmodel-ft

updated a model 2 months ago

Ray2333/GRM-gemma2-2B-rewardmodel-ft

View all activity

Organizations

Ray2333's activity

New activity in Ray2333/GRM-Llama3.2-3B-rewardmodel-ft 3 months ago

Model Size

#1 opened 3 months ago by

commented a paper 3 months ago

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Paper • 2411.00836 • Published Oct 29, 2024 • 15 •

New activity in Ray2333/GRM-llama3-8B-sftreg 3 months ago

Adding `safetensors` variant of this model

#3 opened 3 months ago by

New activity in Ray2333/GRM-llama3-8B-sftreg 5 months ago

Abnormally Large Memory Footprint?

#2 opened 5 months ago by

Some weights of the model checkpoint at Ray2333/GRM-llama3-8B-sftreg were not used when initializing

#1 opened 5 months ago by

New activity in Ray2333/gpt2-large-harmless-reward_model 6 months ago

Load failed:There is no "pytorch_model.bin", how to load the model?

#3 opened 6 months ago by

New activity in Ray2333/gpt2-large-harmless-reward_model 7 months ago

a bug when loading model

#2 opened 7 months ago by

New activity in Ray2333/RiC_harmless_helpful 7 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 7 months ago by

New activity in Ray2333/gpt2-large-harmless-reward_model 10 months ago

How to train the model

#1 opened 10 months ago by