Alpha-VLLM
/

SPHINX

Model card Files Files and versions Community

void0721 commited on Nov 3, 2023

Commit

438eb49

•

1 Parent(s): a9d26e3

Update README.md

Browse files

Files changed (1) hide show

README.md +6 -15

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: mit
 ---
 # 🔥 SPHINX: A Mixer of Tasks, Domains, and Embeddings
-Official implementation of ['SPHINX: A Mixer of Tasks, Domains, and Embeddings Advances Multi-modal Large Language Models']().
 Try out our [web demo 🚀](http://imagebind-llm.opengvlab.com/) here!
@@ -20,23 +20,14 @@ We present $\color{goldenrod}{SPHINX}$, a versatile multi-modal large language m
 - **Domain Mix.** For data from real-world and synthetic domains, we mix the weights of two domain-specific models for complementarity.
-<p align="center">                                                                                                                                          <img src="https://github.com/Alpha-VLLM/LLaMA2-Accessory/blob/main/SPHINX/figs/pipeline.png"/ width="90%"> <br>
 </p>
-<p align="center">                                                                                                                                          <img src="https://github.com/Alpha-VLLM/LLaMA2-Accessory/blob/main/SPHINX/figs/pipeline2.png"/ width="90%"> <br>
-</p>
-## Demo
-Via our proposed three-fold mixer, $\color{goldenrod}{SPHINX}$ exhibits superior multi-modal understanding and reasoning powers.
-<p align="center">                                                                                                                                          <img src="figs/1.png"/ width="70%"> <br>
-</p>
-<p align="center">                                                                                                                                          <img src="figs/2.png"/ width="70%"> <br>
-</p>
-<p align="center">                                                                                                                                          <img src="figs/3.png"/ width="70%"> <br>
-</p>
-<p align="center">                                                                                                                                          <img src="figs/4.png"/ width="50%"> <br>
-</p>
-<p align="center">                                                                                                                                          <img src="figs/5.png"/ width="60%"> <br>
 </p>
 ## Inference
 This section provides a step-by-step guide for hosting a local SPHINX demo. If you're already familiar with the LLAMA2-Accessory toolkit, note that hosting a SPHINX demo follows the same pipeline as hosting demos for the other models supported by LLAMA2-Accessory.

 ---
 # 🔥 SPHINX: A Mixer of Tasks, Domains, and Embeddings
+Official implementation of ['SPHINX: A Mixer of Tasks, Domains, and Embeddings Advances Multi-modal Large Language Models'](https://github.com/Alpha-VLLM/LLaMA2-Accessory/tree/main/SPHINX).
 Try out our [web demo 🚀](http://imagebind-llm.opengvlab.com/) here!
 - **Domain Mix.** For data from real-world and synthetic domains, we mix the weights of two domain-specific models for complementarity.
+<p align="center">
+  <img src="https://github.com/Alpha-VLLM/LLaMA2-Accessory/blob/main/SPHINX/figs/pipeline.png"/ width="90%"> <br>
 </p>
+<p align="center">
+  <img src="https://github.com/Alpha-VLLM/LLaMA2-Accessory/blob/main/SPHINX/figs/pipeline2.png"/ width="90%"> <br>
 </p>
 ## Inference
 This section provides a step-by-step guide for hosting a local SPHINX demo. If you're already familiar with the LLAMA2-Accessory toolkit, note that hosting a SPHINX demo follows the same pipeline as hosting demos for the other models supported by LLAMA2-Accessory.