license: apache-2.0 | |
datasets: | |
- Anthropic/hh-rlhf | |
language: | |
- en | |
pipeline_tag: text-generation | |
The reference model after supervised fine-tuning on the chosen response. |
license: apache-2.0 | |
datasets: | |
- Anthropic/hh-rlhf | |
language: | |
- en | |
pipeline_tag: text-generation | |
The reference model after supervised fine-tuning on the chosen response. |