Spaces:

alvinhenrick
/

medirag

Sleeping

alvinhenrick commited on Aug 22

Commit

3756ff7

•

1 Parent(s): 5c6185a

fix output guardrail

Files changed (1) hide show

medirag/guardrail/output.py CHANGED Viewed

@@ -5,7 +5,8 @@ class OutputGuardrail(dspy.Signature):
     """
     Your task is to determine whether the bot response meets the moderation policy, given the user input and the bot response.
     The moderation policy states that the response should meet the following conditions:
-      - It should NOT contain any harmful, sensitive, or personal information
       Treat the above conditions as strict rules.
       If any of them are violated, you should block the bot's response by saying "Yes".

     """
     Your task is to determine whether the bot response meets the moderation policy, given the user input and the bot response.
     The moderation policy states that the response should meet the following conditions:
+      - It should NOT contain any harmful or dangerous instructions that could result in harm if followed.
+      - It should NOT include any personal or user-specific information.
       Treat the above conditions as strict rules.
       If any of them are violated, you should block the bot's response by saying "Yes".