Spaces:

DeepJudge
/

Applicant-Task-Submission

Sleeping

App Files Files

Timothy-Vinzent commited on Feb 19

Commit

1feb2ff

verified ·

1 Parent(s): 357fcb5

Update app.py

Browse files

Files changed (1) hide show

app.py +93 -0

app.py CHANGED Viewed

@@ -127,6 +127,99 @@ def build_interface():
     """
     with gr.Blocks() as demo:
         gr.Markdown("# GPT-4o Mini System Prompt Submission")
         gr.Markdown("""Classification Task: Document and Clause Level Identification
                     Challenge Description
                     Participants must create a system prompt for a language model that classifies user queries about legal documents into two specific categories:"

     """
     with gr.Blocks() as demo:
         gr.Markdown("# GPT-4o Mini System Prompt Submission")
+        # General description
+        gr.Markdown("""Classification Task: Document and Clause Level Identification
+        Participants must create a system prompt for a language model that classifies user queries about legal documents into two specific categories:
+        1. **Document Level**: Determines whether the query refers to a single document or multiple documents.
+        2. **Clause Level**: Identifies whether the query is focused on:
+            - A single clause,
+            - Multiple clauses, or
+            - General information not constrained to any specific clause.
+        The model must return a valid JSON object with the following structure:
+        ```
+        {
+          "document_level": "single/multiple",
+          "clause_level": "single/multiple/general"
+        }
+        ```
+        The goal is to ensure that the model's output is concise, structured, and accurate. This task is designed to evaluate the robustness of the system prompt in handling classification tasks with short, precise outputs.
+        """)
+        # Example Inputs and Outputs in an Accordion
+        with gr.Accordion("Example Inputs and Expected Outputs", open=False):
+            gr.Markdown("""
+            1. **User Message Example 1:**
+            - *"Please provide the contract for the lease agreement."*
+            - **Expected Output:**
+            ```
+            {"document_level": "single", "clause_level": "general"}
+            ```
+            2. **User Message Example 2:**
+            - *"I need all clauses related to termination in the employment contract."*
+            - **Expected Output:**
+            ```
+            {"document_level": "single", "clause_level": "multiple"}
+            ```
+            3. **User Message Example 3:**
+            - *"Can you send me the financial reports and the partnership agreement?"*
+            - **Expected Output:**
+            ```
+            {"document_level": "multiple", "clause_level": "general"}
+            ```
+            4. **User Message Example 4:**
+            - *"What are the key clauses in the NDA?"*
+            - **Expected Output:**
+            ```
+            {"document_level": "single", "clause_level": "multiple"}
+            ```
+            5. **User Message Example 5:**
+            - *"Tell me about the company’s financials."*
+            - **Expected Output:**
+            ```
+            {"document_level": "single", "clause_level": "general"}
+            ```
+            6. **User Message Example 6:**
+            - *"Provide all contracts and their confidentiality clauses."*
+            - **Expected Output:**
+            ```
+            {"document_level": "multiple", "clause_level": "multiple"}
+            ```
+            7. **User Message Example 7:**
+            - *"Extract the arbitration clause from this service agreement."*
+            - **Expected Output:**
+            ```
+            {"document_level": "single", "clause_level": "single"}
+            ```
+            """)
+        # Challenge instructions in another Accordion
+        with gr.Accordion("Challenge Instructions", open=False):
+            gr.Markdown("""
+            - Design a system prompt that ensures the AI generates outputs like those above when given similar user messages.
+              The system prompt should:
+              1. Specify formatting requirements (e.g., *"Output must be a valid JSON object"*). Note that we are not using constrained decoding or any sort of JSON mode; if not correctly prompted, the LLM will output plain text.
+              2. Emphasize strict adherence to classification definitions:
+                  - *Single Document:* Refers to one document.
+                  - *Multiple Documents:* Refers to more than one document.
+                  - *Single Clause:* Refers to one specific clause.
+                  - *Multiple Clauses:* Refers to more than one specific clause.
+                  - *General Information:* Refers to general content not tied to specific clauses.
+              You can only submit once, so test your system prompt thoroughly before submission!
+              """)
         gr.Markdown("""Classification Task: Document and Clause Level Identification
                     Challenge Description
                     Participants must create a system prompt for a language model that classifies user queries about legal documents into two specific categories:"