Text Generation
Transformers
Safetensors
English
deberta
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
Dongfu Jiang commited on
Commit
5b880cc
1 Parent(s): d6bc040

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -3,7 +3,7 @@ license: mit
3
  datasets:
4
  - openai/summarize_from_feedback
5
  - openai/webgpt_comparisons
6
- - Dahoas/instruct-synthetic-prompt-responses
7
  - Anthropic/hh-rlhf
8
  - lmsys/chatbot_arena_conversations
9
  - openbmb/UltraFeedback
@@ -199,7 +199,7 @@ Learn more in our LLM-Blender Github [README.md](https://github.com/yuchenlin/LL
199
  ### Training Datasets
200
  - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback)
201
  - [openai/webgpt_comparisons](https://huggingface.co/datasets/openai/webgpt_comparisons)
202
- - [Dahoas/instruct-synthetic-prompt-responses](https://huggingface.co/datasets/Dahoas/instruct-synthetic-prompt-responses)
203
  - [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf)
204
  - [lmsys/chatbot_arena_conversations](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
205
  - [openbmb/UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)
 
3
  datasets:
4
  - openai/summarize_from_feedback
5
  - openai/webgpt_comparisons
6
+ - Dahoas/synthetic-instruct-gptj-pairwise
7
  - Anthropic/hh-rlhf
8
  - lmsys/chatbot_arena_conversations
9
  - openbmb/UltraFeedback
 
199
  ### Training Datasets
200
  - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback)
201
  - [openai/webgpt_comparisons](https://huggingface.co/datasets/openai/webgpt_comparisons)
202
+ - [Dahoas/synthetic-instruct-gptj-pairwise](https://huggingface.co/datasets/Dahoas/synthetic-instruct-gptj-pairwise)
203
  - [Anthropic/hh-rlhf](https://huggingface.co/datasets/Anthropic/hh-rlhf)
204
  - [lmsys/chatbot_arena_conversations](https://huggingface.co/datasets/lmsys/chatbot_arena_conversations)
205
  - [openbmb/UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback)