Spaces:

nielsr
/

vilt-nlvr

Running

nielsr HF staff commited on Dec 22, 2021

Commit

edac551

•

1 Parent(s): 61ac553

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -29,9 +29,9 @@ images = [gr.inputs.Image(type="pil"), gr.inputs.Image(type="pil")]
 text = gr.inputs.Textbox(lines=2, label="Sentence")
 answer = gr.outputs.Textbox(label="Predicted answer")
-example_sentence_1 = "One image contains twice the number of dogs as the other image, and at least two dogs in total are standing."
 example_sentence_2 = "One image shows exactly two brown acorns in back-to-back caps on green foliage."
-examples = [["image1.jpg", "image2.jpg", example_sentence_1], ["image1.jpg", "image2.jpg", example_sentence_2]]
 title = "Interactive demo: natural language visual reasoning with ViLT"
 description = "Gradio Demo for ViLT (Vision and Language Transformer), fine-tuned on NLVR2. To use it, simply upload a pair of images and type a sentence and click 'submit', or click one of the examples to load them. The model will predict whether the sentence is true or false, based on the 2 images. Read more at the links below."

 text = gr.inputs.Textbox(lines=2, label="Sentence")
 answer = gr.outputs.Textbox(label="Predicted answer")
+example_sentence_1 = "The left image contains twice the number of dogs as the right image, and at least two dogs in total are standing."
 example_sentence_2 = "One image shows exactly two brown acorns in back-to-back caps on green foliage."
+examples = [["image1.jpg", "image2.jpg", example_sentence_1], ["image3.jpg", "image4.jpg", example_sentence_2]]
 title = "Interactive demo: natural language visual reasoning with ViLT"
 description = "Gradio Demo for ViLT (Vision and Language Transformer), fine-tuned on NLVR2. To use it, simply upload a pair of images and type a sentence and click 'submit', or click one of the examples to load them. The model will predict whether the sentence is true or false, based on the 2 images. Read more at the links below."