Transformers
PyTorch
English
llama
reward model
RLHF
RLAIF
text-generation-inference
banghua's picture davide221's picture
Missing import for inference (#5)
1a1d0e4 verified