llm-blender
/

PairRM

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Dongfu Jiang commited on Jan 22

Commit

5b880cc

•

1 Parent(s): d6bc040

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: mit
 datasets:
 - openai/summarize_from_feedback
 - openai/webgpt_comparisons
-- Dahoas/instruct-synthetic-prompt-responses
 - Anthropic/hh-rlhf
 - lmsys/chatbot_arena_conversations
 - openbmb/UltraFeedback
@@ -199,7 +199,7 @@ Learn more in our LLM-Blender Github [README.md](https://github.com/yuchenlin/LL
 ### Training Datasets
 - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback)
 - [openai/webgpt_comparisons](https://huggingface.co/datasets/openai/webgpt_comparisons)
-- [Dahoas/instruct-synthetic-prompt-responses](https://huggingface.co/datasets/Dahoas/instruct-synthetic-prompt-responses)
 - [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf)
 - [lmsys/chatbot_arena_conversations](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
 - [openbmb/UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)

 datasets:
 - openai/summarize_from_feedback
 - openai/webgpt_comparisons
+- Dahoas/synthetic-instruct-gptj-pairwise
 - Anthropic/hh-rlhf
 - lmsys/chatbot_arena_conversations
 - openbmb/UltraFeedback
 ### Training Datasets
 - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback)
 - [openai/webgpt_comparisons](https://huggingface.co/datasets/openai/webgpt_comparisons)
+- [Dahoas/synthetic-instruct-gptj-pairwise](https://huggingface.co/datasets/Dahoas/synthetic-instruct-gptj-pairwise)
 - [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf)
 - [lmsys/chatbot_arena_conversations](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
 - [openbmb/UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)