OpenAssistant
/

oasst-rm-2-pythia-6.9b-epoch-1

gpt_neox_reward_model

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (0)

The output of the reward model is a two-dimensional vector, what does each dimension mean？

#3 opened 6 months ago by

More details on training data for reward model

#2 opened 8 months ago by

Where is the input file of augment_oasst ?

#1 opened 9 months ago by