weqweasdas ZennyKenny commited on
Commit
81b58a2
1 Parent(s): e8adab3

Fix dataset link (#1)

Browse files

- Fix dataset link (4222ba004d13090c57a395c83f2aa062545d8fe6)


Co-authored-by: Kenneth Hamilton <ZennyKenny@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -18,7 +18,7 @@ If you have any question with this reward model and also any question about rewa
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
- The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](weqweasdas/preference_dataset_mixture2_and_safe_pku).
22
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
23
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
24
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
 
18
 
19
  <!-- Provide a longer summary of what this model is. -->
20
 
21
+ The model is trained on a mixture of the following datasets. We also provide the mixture in [weqweasdas/preference_dataset_mixture2_and_safe_pku](https://huggingface.co/datasets/weqweasdas/preference_dataset_mixture2_and_safe_pku).
22
  - [HH-RLHF](https://huggingface.co/datasets/Anthropic/hh-rlhf)
23
  - [SHP](https://huggingface.co/datasets/stanfordnlp/SHP)
24
  - [UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)