Testig reward model for RLHF on 1000 examples from [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf).