Update README.md
Browse files
README.md
CHANGED
@@ -48,7 +48,7 @@ We trained PairRM on a diverse collection of six human-preference datasets:
|
|
48 |
- [`chatbot_arena_conversations`](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
|
49 |
- [`webgpt_comparisons`](https://huggingface.co/datasets/openai/webgpt_comparisons)
|
50 |
- [`instruct-synthetic-prompt-responses`](https://huggingface.co/datasets/Dahoas/instruct-synthetic-prompt-responses).
|
51 |
-
|
52 |
PairRM is part of the LLM-Blender project (ACL 2023). Please see our [paper](https://arxiv.org/abs/2306.02561) above to know more.
|
53 |
|
54 |
|
|
|
48 |
- [`chatbot_arena_conversations`](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
|
49 |
- [`webgpt_comparisons`](https://huggingface.co/datasets/openai/webgpt_comparisons)
|
50 |
- [`instruct-synthetic-prompt-responses`](https://huggingface.co/datasets/Dahoas/instruct-synthetic-prompt-responses).
|
51 |
+
|
52 |
PairRM is part of the LLM-Blender project (ACL 2023). Please see our [paper](https://arxiv.org/abs/2306.02561) above to know more.
|
53 |
|
54 |
|