Text Classification
Transformers
PyTorch
English
deberta-v2
reward-model
reward_model
RLHF
Inference Endpoints

How to fine tune this model with the Trainer API?

#8
by duzm - opened

when I fune tune this model,I get next error:
RuntimeError: The size of tensor a (2) must match the size of tensor b (8) at non-singleton dimension 1

Sign up or log in to comment