weqweasdas commited on
Commit
47d0c20
1 Parent(s): 5519e53

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -18,8 +18,7 @@ If you have any question with this reward model and also any question about rewa
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
- The model is trained on a mixture of the dataset similar to [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
22
-
23
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
24
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
25
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
 
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
+ The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](weqweasdas/preference_dataset_mixture2_and_safe_pku).
 
22
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
23
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
24
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)