--- license: apache-2.0 datasets: - Anthropic/hh-rlhf language: - en pipeline_tag: text-generation --- The reference model after supervised fine-tuning on the chosen response.