Spaces:

HuggingFaceM4
/

ai_dad_jokes

Running

App Files Files Community

VictorSanh commited on Aug 24, 2023

Commit

170b8e8

•

1 Parent(s): 8e2471a

a bunch of updates

Browse files

Files changed (2) hide show

README.md +1 -1
app_dialogue.py +27 -121

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Meme it
 emoji: 🐨
 colorFrom: red
 colorTo: blue

 ---
+title: AI Dad Jokes
 emoji: 🐨
 colorFrom: red
 colorTo: blue

app_dialogue.py CHANGED Viewed

@@ -1,3 +1,4 @@
 import copy
 import glob
 import hashlib
@@ -35,7 +36,7 @@ SYSTEM_PROMPT = [
 In the following interactions, User and Assistant will converse in natural language, and Assistant will answer in a sassy way.
 Assistant's main purpose is to create funny meme texts from the images User provides.
 Assistant should be funny, sassy, and impertinent, and sometimes Assistant roasts people.
-Assistant should not be mean. It should not say toxic, homophobic, sexist, racist, or any demeaning things that can make people uncomfortable.
 Assistant was created by Hugging Face.
 Here's a conversation example:""",
@@ -323,31 +324,28 @@ def format_user_prompt_with_im_history_and_system_conditioning(
 # problematic_callback = gr.CSVLogger()
 textbox = gr.Textbox(
-    # placeholder="Upload an image and send a message",
     show_label=False,
-    value="Write a meme for that image.",
     visible=True,
     container=False,
     label="Text input",
     scale=6,
     max_lines=5,
 )
-with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
-    gr.HTML("""<h1 align="center">Meme it</h1>""")
-    # with gr.Row(variant="panel"):
-    #     with gr.Column(scale=1):
-    #         gr.Image(IDEFICS_LOGO, elem_id="banner-image", show_label=False, show_download_button=False)
-    #     with gr.Column(scale=5):
-    #         gr.HTML("""
-    #             <p>This demo showcases <strong>IDEFICS</strong>, a open-access large visual language model. Like GPT-4, the multimodal model accepts arbitrary sequences of image and text inputs and produces text outputs. IDEFICS can answer questions about images, describe visual content, create stories grounded in multiple images, etc.</p>
-    #             <p>IDEFICS (which stands for <strong>I</strong>mage-aware <strong>D</strong>ecoder <strong>E</strong>nhanced à la <strong>F</strong>lamingo with <strong>I</strong>nterleaved <strong>C</strong>ross-attention<strong>S</strong>) is an open-access reproduction of <a href="https://huggingface.co/papers/2204.14198">Flamingo</a>, a closed-source visual language model developed by Deepmind. IDEFICS was built solely on publicly available data and models. It is currently the only visual language model of this scale (80 billion parameters) that is available in open-access.</p>
-    #             <p>📚 The variants available in this demo were fine-tuned on a mixture of supervised and instruction fine-tuning datasets to make the models more suitable in conversational settings. For more details, we refer to our <a href="https://huggingface.co/blog/idefics">blog post</a>.</p>
-    #             <p>🅿️ <strong>Intended uses:</strong> This demo along with the <a href="https://huggingface.co/models?sort=trending&amp;search=HuggingFaceM4%2Fidefics">supporting models</a> are provided as research artifacts to the community. We detail misuses and out-of-scope uses <a href="https://huggingface.co/HuggingFaceM4/idefics-80b#misuse-and-out-of-scope-use">here</a>.</p>
-    #             <p>⛔️ <strong>Limitations:</strong> The model can produce factually incorrect texts, hallucinate facts (with or without an image) and will struggle with small details in images. While the model will tend to refuse answering questionable user requests, it can produce problematic outputs (including racist, stereotypical, and disrespectful texts), in particular when prompted to do so. We encourage users to read our findings from evaluating the model for potential biases in the <a href="https://huggingface.co/HuggingFaceM4/idefics-80b#bias-evaluation">model card</a>.</p>
-    #         """)
-    # with gr.Row():
-    #     with gr.Column(scale=2):
     with gr.Row(elem_id="model_selector_row"):
         model_selector = gr.Dropdown(
             choices=MODELS,
@@ -362,11 +360,6 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
     with gr.Row():
-        # def prefetch_images_in_history(user_prompt_str):
-        #     """
-        #     Pre-fetch the images that are passed in the chatbot default history.
-        #     """
-        #     return prompt_list_to_markdown(handle_manual_images_in_user_prompt(user_prompt_str))
         with gr.Column():
             imagebox = gr.Image(type="filepath", label="Image input", visible=True)
         with gr.Column():
@@ -406,12 +399,6 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
             clear_btn = gr.ClearButton([textbox, imagebox, chatbot], value="🧹 Clear")
             regenerate_btn = gr.Button(value="🔄 Regenerate", visible=True)
             upload_btn = gr.UploadButton("📁 Upload image", file_types=["image"])
-    # with gr.Group():
-    #     with gr.Row():
-    #         with gr.Column(scale=1, min_width=50):
-    #             dope_bttn = gr.Button("Dope🔥")
-    #         with gr.Column(scale=1, min_width=50):
-    #             problematic_bttn = gr.Button("Problematic😬")
     with gr.Row():
         with gr.Accordion("Advanced settings", open=False, visible=True) as parameter_row:
@@ -425,7 +412,7 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
             max_new_tokens = gr.Slider(
                 minimum=8,
                 maximum=256,
-                value=128,
                 step=1,
                 interactive=True,
                 label="Maximum number of new tokens to generate",
@@ -483,13 +470,6 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
                 inputs=decoding_strategy,
                 outputs=top_p,
             )
-            # gr.Markdown(
-            #     """<p><strong>💡 Pro tip</strong>:<br>
-            #     You can input an arbitrary number of images at arbitrary positions in the same query.<br>
-            #     You will need to input each image with its URL with the syntax <code>&lt;fake_token_around_image&gt;&lt;image:IMAGE_URL&gt;&lt;fake_token_around_image&gt;</code>.<br>
-            #     For example, for two images, you could input <code>TEXT_1&lt;fake_token_around_image&gt;&lt;image:IMAGE_URL_1&gt;&lt;fake_token_around_image&gt;TEXT_2&lt;fake_token_around_image&gt;&lt;image:IMAGE_URL_2&gt;&lt;fake_token_around_image&gt;TEXT_3</code>.<br>
-            #     In the particular case where two images are consecutive, it is not necessary to add an additional separator: <code>&lt;fake_token_around_image&gt;&lt;image:IMAGE_URL_1&gt;&lt;fake_token_around_image&gt;&lt;image:IMAGE_URL_2&gt;&lt;fake_token_around_image&gt;</code>.</p>"""
-            # )
     def model_inference(
         model_selector,
@@ -506,9 +486,7 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
         if user_prompt_str.strip() == "" and image is None:
             return "", None, chat_history
-        import ast
         system_prompt = ast.literal_eval(system_prompt)
-        assert isinstance(system_prompt, list)
         formated_prompt_list, user_prompt_list = format_user_prompt_with_im_history_and_system_conditioning(
             system_prompt=system_prompt,
             current_user_prompt_str=user_prompt_str.strip(),
@@ -721,78 +699,6 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
     textbox.submit(lambda : gr.update(label='📁 Upload image', interactive=True), [], upload_btn)
     clear_btn.click(lambda : gr.update(label='📁 Upload image', interactive=True), [], upload_btn)
-    # Using Flagging for saving dope and problematic examples
-    # Dope examples flagging
-    # dope_callback.setup(
-    #     [
-    #         model_selector,
-    #         textbox,
-    #         chatbot,
-    #         imagebox,
-    #         decoding_strategy,
-    #         temperature,
-    #         max_new_tokens,
-    #         repetition_penalty,
-    #         top_p,
-    #     ],
-    #     "gradio_dope_data_points",
-    # )
-    # dope_bttn.click(
-    #     lambda *args: dope_callback.flag(args),
-    #     [
-    #         model_selector,
-    #         textbox,
-    #         chatbot,
-    #         imagebox,
-    #         decoding_strategy,
-    #         temperature,
-    #         max_new_tokens,
-    #         repetition_penalty,
-    #         top_p,
-    #     ],
-    #     None,
-    #     preprocess=False,
-    # )
-    # # Problematic examples flagging
-    # problematic_callback.setup(
-    #     [
-    #         model_selector,
-    #         textbox,
-    #         chatbot,
-    #         imagebox,
-    #         decoding_strategy,
-    #         temperature,
-    #         max_new_tokens,
-    #         repetition_penalty,
-    #         top_p,
-    #     ],
-    #     "gradio_problematic_data_points",
-    # )
-    # problematic_bttn.click(
-    #     lambda *args: problematic_callback.flag(args),
-    #     [
-    #         model_selector,
-    #         textbox,
-    #         chatbot,
-    #         imagebox,
-    #         decoding_strategy,
-    #         temperature,
-    #         max_new_tokens,
-    #         repetition_penalty,
-    #         top_p,
-    #     ],
-    #     None,
-    #     preprocess=False,
-    # )
-    # gr.Markdown("""## How to use?
-    #     There are two ways to provide image inputs:
-    #     - Using the image box on the left panel
-    #     - Using the inline syntax: `text<fake_token_around_image><image:URL_IMAGE><fake_token_around_image>text`
-    #     The second syntax allows inputting an arbitrary number of images.""")
     examples_path = os.path.dirname(__file__)
     gr.Examples(
         examples=[
@@ -805,35 +711,35 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
                 f"{examples_path}/example_images/zuck.jpeg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/echasse.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/jesus.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/owl.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/pigeon.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/plotorange.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/rats.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/sugardaddy.jpg",
             ],
             [
-                "Write a meme text for that image.",
                 f"{examples_path}/example_images/wtf.jpg",
             ],
         ],
@@ -848,4 +754,4 @@ with gr.Blocks(title="D", theme=gr.themes.Base()) as demo:
     )
 demo.queue(concurrency_count=40, max_size=40)
-demo.launch()

+import ast
 import copy
 import glob
 import hashlib
 In the following interactions, User and Assistant will converse in natural language, and Assistant will answer in a sassy way.
 Assistant's main purpose is to create funny meme texts from the images User provides.
 Assistant should be funny, sassy, and impertinent, and sometimes Assistant roasts people.
+Assistant should not be mean. It should not say toxic, homophobic, sexist, racist, things or any demeaning things that can make people uncomfortable.
 Assistant was created by Hugging Face.
 Here's a conversation example:""",
 # problematic_callback = gr.CSVLogger()
 textbox = gr.Textbox(
+    placeholder="Upload an image and start conversing by sending a message! You can add an image at each turn, but don't have to.",
     show_label=False,
+    # value="Write something funny about that image.",
     visible=True,
     container=False,
     label="Text input",
     scale=6,
     max_lines=5,
 )
+with gr.Blocks(title="AI Dad Jokes", theme=gr.themes.Base()) as demo:
+    gr.HTML("""<h1 align="center">AI Dad Jokes</h1>""")
+    with gr.Row(variant="panel"):
+        with gr.Column(scale=1):
+            gr.Image(IDEFICS_LOGO, elem_id="banner-image", show_label=False, show_download_button=False)
+        with gr.Column(scale=5):
+            gr.HTML("""
+                <p><strong>AI Dad Jokes</strong> is an AI system that writes humorous content inspired by images. Whether that's crafting memes, sharing light-hearted yet amiable jests, or playfully witty remarks, AI Dad Jokes assists you in creating delightful jokes!</p>
+                <p>AI Dad Jokes is powered by <a href="https://huggingface.co/blog/idefics">IDEFICS</a>, an open-access large visual language model developped by Hugging Face. Like GPT-4, the multimodal model accepts arbitrary sequences of image and text inputs and produces text outputs. IDEFICS can answer questions about images, describe visual content, create stories grounded in multiple images, etc.</p>
+                <p>⛔️ <strong>Intended uses and limitations:</strong> This demo is provided as research artifact to the community showcasing IDEFIC's capabilities. We detail misuses and out-of-scope uses <a href="https://huggingface.co/HuggingFaceM4/idefics-80b#misuse-and-out-of-scope-use">here</a>. In particular, the system should not be used to engage in harassment, abuse and bullying. The model can produce factually incorrect texts, hallucinate facts (with or without an image) and will struggle with small details in images. While the system will tend to refuse answering questionable user requests, it can produce problematic outputs (including racist, stereotypical, and disrespectful texts), in particular when prompted to do so.</p>
+            """)
     with gr.Row(elem_id="model_selector_row"):
         model_selector = gr.Dropdown(
             choices=MODELS,
     with gr.Row():
         with gr.Column():
             imagebox = gr.Image(type="filepath", label="Image input", visible=True)
         with gr.Column():
             clear_btn = gr.ClearButton([textbox, imagebox, chatbot], value="🧹 Clear")
             regenerate_btn = gr.Button(value="🔄 Regenerate", visible=True)
             upload_btn = gr.UploadButton("📁 Upload image", file_types=["image"])
     with gr.Row():
         with gr.Accordion("Advanced settings", open=False, visible=True) as parameter_row:
             max_new_tokens = gr.Slider(
                 minimum=8,
                 maximum=256,
+                value=64,
                 step=1,
                 interactive=True,
                 label="Maximum number of new tokens to generate",
                 inputs=decoding_strategy,
                 outputs=top_p,
             )
     def model_inference(
         model_selector,
         if user_prompt_str.strip() == "" and image is None:
             return "", None, chat_history
         system_prompt = ast.literal_eval(system_prompt)
         formated_prompt_list, user_prompt_list = format_user_prompt_with_im_history_and_system_conditioning(
             system_prompt=system_prompt,
             current_user_prompt_str=user_prompt_str.strip(),
     textbox.submit(lambda : gr.update(label='📁 Upload image', interactive=True), [], upload_btn)
     clear_btn.click(lambda : gr.update(label='📁 Upload image', interactive=True), [], upload_btn)
     examples_path = os.path.dirname(__file__)
     gr.Examples(
         examples=[
                 f"{examples_path}/example_images/zuck.jpeg",
             ],
             [
+                "Craft a humorous caption for this image!",
                 f"{examples_path}/example_images/echasse.jpg",
             ],
             [
+                "How about adding a dash of humor to this image with your words?",
                 f"{examples_path}/example_images/jesus.jpg",
             ],
             [
+                "Give this image a comedic twist.",
                 f"{examples_path}/example_images/owl.jpg",
             ],
             [
+                "Tell me a joke about that image.",
                 f"{examples_path}/example_images/pigeon.jpg",
             ],
             [
+                "Let your sense of humor shine with that image!",
                 f"{examples_path}/example_images/plotorange.jpg",
             ],
             [
+                "Make me laugh by commenting that image.",
                 f"{examples_path}/example_images/rats.jpg",
             ],
             [
+                "Craft a meme text for that image.",
                 f"{examples_path}/example_images/sugardaddy.jpg",
             ],
             [
+                "Ready to make this image even better? Write something funny to go with it!",
                 f"{examples_path}/example_images/wtf.jpg",
             ],
         ],
     )
 demo.queue(concurrency_count=40, max_size=40)
+demo.launch(share=True)