Transformers
PyTorch
English
reward model
RLHF
RLAIF
Inference Endpoints
banghua's picture
Update README.md
abf765b verified