upload

Files changed (14) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.json filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,37 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+language:
+- en
 ---
+# SPHINX-V Model Card
+## Model type:
+**SPHINX-V** is a new multimodal large language model designed for visual prompting, equipped with a novel visual prompt encoder and a two-stage training strategy. SPHINX-V supports multiple visual prompts simultaneously across various types, significantly enhancing user flexibility and achieve a fine-grained and open-world understanding of visual prompts.
+## Paper or resources for more information:
+Project Page: [Home]() \
+Paper: []() \
+Code: [https://github.com/AFeng-x/Draw-and-Understand](https://github.com/AFeng-x/Draw-and-Understand) \
+Dataset: [Draw-and-Understand](https://huggingface.co/datasets/Afeng-x/Draw-and-Understand)
+## Intended use
+**Primary intended uses:**
+The principal application of SPHINX-V is centered around conducting research in the realm of visual prompting large multimodal models and chatbots.
+**Primary intended users:**
+The model is primarily designed for use by researchers and enthusiasts specializing in fields such as computer vision, natural language processing, and interactive artificial intelligence.
+## License
+Llama 2 is licensed under the LLAMA 2 Community License,
+Copyright (c) Meta Platforms, Inc. All Rights Reserved.
+## Citations
+```
+@misc{
+}
+```

llama-2-13b/params.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:713ac69eaa9387a19d25c10e03270de04a5fe4232f136751595683c26f6f996a
+size 102

sphinx-v/stage1/config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1f80dcc4646093073333eddf877fc16708fac2eb1000375d2c52e6c26b2bab52
+size 307

sphinx-v/stage1/consolidated.00-of-02.model.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5bdc834c1fd336c291d63764175aae9dd337fab5e530e3fb5c40d4b41da9ba03
+size 19915449769

sphinx-v/stage1/consolidated.01-of-02.model.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:f83eb677c401a6a66b42174358240ad9d2da8312b78bde882bd8afd3cebd343c
+size 19915449769

sphinx-v/stage1/meta.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:26eec800d54c2dce59aaa825e3946e5ac96f8084ed9460c5d115cfe3a19528af
+size 35

sphinx-v/stage1/tokenizer.model ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
+size 499723

sphinx-v/stage2/config.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:12866b022e943d665794df827d575020d5c4db70e3e66d1f9c590d6cdb07b211
+size 307

sphinx-v/stage2/consolidated.00-of-02.model.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3831753397639c88e652577cfc9d258363ec1b828886a3d1ff448cd6e2f16736
+size 19915442281

sphinx-v/stage2/consolidated.01-of-02.model.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:6ed36d752126fcc60b98210cf524c40d3ebc40eb81956a8fa09ebc8b4de7afad
+size 19915442281

sphinx-v/stage2/meta.json ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:26eec800d54c2dce59aaa825e3946e5ac96f8084ed9460c5d115cfe3a19528af
+size 35

sphinx-v/stage2/tokenizer.model ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
+size 499723

tokenizer/tokenizer.model ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
+size 499723