remyxai
/

SpaceFlorence-2

Model card Files Files and versions Community

salma-remyx commited on Aug 16, 2024

Commit

870a1f9

·

verified ·

1 Parent(s): 8a5f18e

Update README.md

Files changed (1) hide show

README.md +21 -1

README.md CHANGED Viewed

@@ -15,4 +15,24 @@ datasets:
 ### Model Sources
 - **Dataset:** [SpaceLLaVA](https://huggingface.co/datasets/remyxai/vqasynth_spacellava)
 - **Repository:** [VQASynth](https://github.com/remyxai/VQASynth/tree/main)
-- **Paper:** [SpatialVLM](https://arxiv.org/abs/2401.12168)

 ### Model Sources
 - **Dataset:** [SpaceLLaVA](https://huggingface.co/datasets/remyxai/vqasynth_spacellava)
 - **Repository:** [VQASynth](https://github.com/remyxai/VQASynth/tree/main)
+- **Paper:** [SpatialVLM](https://arxiv.org/abs/2401.12168)
+## Citation
+```
+@article{chen2024spatialvlm,
+  title = {SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities},
+  author = {Chen, Boyuan and Xu, Zhuo and Kirmani, Sean and Ichter, Brian and Driess, Danny and Florence, Pete and Sadigh, Dorsa and Guibas, Leonidas and Xia, Fei},
+  journal = {arXiv preprint arXiv:2401.12168},
+  year = {2024},
+  url = {https://arxiv.org/abs/2401.12168},
+}
+@article{xiao2023florence,
+  title={Florence-2: Advancing a unified representation for a variety of vision tasks},
+  author={Xiao, Bin and Wu, Haiping and Xu, Weijian and Dai, Xiyang and Hu, Houdong and Lu, Yumao and Zeng, Michael and Liu, Ce and Yuan, Lu},
+  journal={arXiv preprint arXiv:2311.06242},
+  year={2023}
+}
+```