ContextualAI
/

Contextual_KTO_Mistral_PairRM

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

xwinxu commited on Mar 5

Commit

06fc6e3

•

1 Parent(s): d8380f4

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -12,9 +12,7 @@ tags:
 - dpo
 - rl
 datasets:
-- stanfordnlp/SHP
-- Anthropic/hh-rlhf
-- OpenAssistant/oasst1
 metrics:
 - accuracy
 ---
@@ -30,6 +28,7 @@ To prompt Archangel models, ensure that the format is consistent with that of Tu
 For example, a prompt should be formatted as follows, where `<|user|>` corresponds to the human's role and `<|assistant|>` corresponds to the LLM's role.
 The human should speak first:
 ```
 <|user|>
 Hi! I'm looking for a cake recipe.
 <|assistant|>

 - dpo
 - rl
 datasets:
+- snorkelai/Snorkel-Mistral-PairRM-DPO-Dataset
 metrics:
 - accuracy
 ---
 For example, a prompt should be formatted as follows, where `<|user|>` corresponds to the human's role and `<|assistant|>` corresponds to the LLM's role.
 The human should speak first:
 ```
 <|user|>
 Hi! I'm looking for a cake recipe.
 <|assistant|>