DongfuJiang
commited on
Commit
•
3f7f343
1
Parent(s):
202f409
update
Browse files
app.py
CHANGED
@@ -109,7 +109,7 @@ def build_demo():
|
|
109 |
gr.Markdown(""" # Mantis
|
110 |
Mantis is a multimodal conversational AI model that can chat with users about images and text. It's optimized for multi-image reasoning, where inverleaved text and images can be used to generate responses.
|
111 |
|
112 |
-
[Paper](https://arxiv.org/abs/2405.01483) | [Github](https://github.com/TIGER-AI-Lab/Mantis) | [Models](https://huggingface.co/collections/TIGER-Lab/mantis-6619b0834594c878cdb1d6e4) | [Dataset](https://huggingface.co/datasets/TIGER-Lab/Mantis-Instruct)
|
113 |
""")
|
114 |
|
115 |
gr.Markdown("""## Chat with Mantis
|
@@ -183,6 +183,15 @@ Mantis is a multimodal conversational AI model that can chat with users about im
|
|
183 |
inputs=[chat_input],
|
184 |
)
|
185 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
186 |
return demo
|
187 |
|
188 |
|
|
|
109 |
gr.Markdown(""" # Mantis
|
110 |
Mantis is a multimodal conversational AI model that can chat with users about images and text. It's optimized for multi-image reasoning, where inverleaved text and images can be used to generate responses.
|
111 |
|
112 |
+
### [Paper](https://arxiv.org/abs/2405.01483) | [Github](https://github.com/TIGER-AI-Lab/Mantis) | [Models](https://huggingface.co/collections/TIGER-Lab/mantis-6619b0834594c878cdb1d6e4) | [Dataset](https://huggingface.co/datasets/TIGER-Lab/Mantis-Instruct)
|
113 |
""")
|
114 |
|
115 |
gr.Markdown("""## Chat with Mantis
|
|
|
183 |
inputs=[chat_input],
|
184 |
)
|
185 |
|
186 |
+
gr.Markdown("""
|
187 |
+
## Citation
|
188 |
+
```
|
189 |
+
@article{jiang2024mantis,
|
190 |
+
title={MANTIS: Interleaved Multi-Image Instruction Tuning},
|
191 |
+
author={Jiang, Dongfu and He, Xuan and Zeng, Huaye and Wei, Con and Ku, Max and Liu, Qian and Chen, Wenhu},
|
192 |
+
journal={arXiv preprint arXiv:2405.01483},
|
193 |
+
year={2024}
|
194 |
+
}```""")
|
195 |
return demo
|
196 |
|
197 |
|