upload
Browse files- .gitattributes +1 -0
- README.md +34 -0
- llama-2-13b/params.json +3 -0
- sphinx-v/stage1/config.json +3 -0
- sphinx-v/stage1/consolidated.00-of-02.model.pth +3 -0
- sphinx-v/stage1/consolidated.01-of-02.model.pth +3 -0
- sphinx-v/stage1/meta.json +3 -0
- sphinx-v/stage1/tokenizer.model +3 -0
- sphinx-v/stage2/config.json +3 -0
- sphinx-v/stage2/consolidated.00-of-02.model.pth +3 -0
- sphinx-v/stage2/consolidated.01-of-02.model.pth +3 -0
- sphinx-v/stage2/meta.json +3 -0
- sphinx-v/stage2/tokenizer.model +3 -0
- tokenizer/tokenizer.model +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,3 +1,37 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- en
|
5 |
---
|
6 |
+
|
7 |
+
# SPHINX-V Model Card
|
8 |
+
|
9 |
+
## Model type:
|
10 |
+
**SPHINX-V** is a new multimodal large language model designed for visual prompting, equipped with a novel visual prompt encoder and a two-stage training strategy. SPHINX-V supports multiple visual prompts simultaneously across various types, significantly enhancing user flexibility and achieve a fine-grained and open-world understanding of visual prompts.
|
11 |
+
|
12 |
+
|
13 |
+
## Paper or resources for more information:
|
14 |
+
Project Page: [Home]() \
|
15 |
+
Paper: []() \
|
16 |
+
Code: [https://github.com/AFeng-x/Draw-and-Understand](https://github.com/AFeng-x/Draw-and-Understand) \
|
17 |
+
Dataset: [Draw-and-Understand](https://huggingface.co/datasets/Afeng-x/Draw-and-Understand)
|
18 |
+
|
19 |
+
|
20 |
+
## Intended use
|
21 |
+
**Primary intended uses:**
|
22 |
+
The principal application of SPHINX-V is centered around conducting research in the realm of visual prompting large multimodal models and chatbots.
|
23 |
+
|
24 |
+
**Primary intended users:**
|
25 |
+
The model is primarily designed for use by researchers and enthusiasts specializing in fields such as computer vision, natural language processing, and interactive artificial intelligence.
|
26 |
+
|
27 |
+
|
28 |
+
## License
|
29 |
+
Llama 2 is licensed under the LLAMA 2 Community License,
|
30 |
+
Copyright (c) Meta Platforms, Inc. All Rights Reserved.
|
31 |
+
|
32 |
+
|
33 |
+
## Citations
|
34 |
+
```
|
35 |
+
@misc{
|
36 |
+
}
|
37 |
+
```
|
llama-2-13b/params.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:713ac69eaa9387a19d25c10e03270de04a5fe4232f136751595683c26f6f996a
|
3 |
+
size 102
|
sphinx-v/stage1/config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1f80dcc4646093073333eddf877fc16708fac2eb1000375d2c52e6c26b2bab52
|
3 |
+
size 307
|
sphinx-v/stage1/consolidated.00-of-02.model.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5bdc834c1fd336c291d63764175aae9dd337fab5e530e3fb5c40d4b41da9ba03
|
3 |
+
size 19915449769
|
sphinx-v/stage1/consolidated.01-of-02.model.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f83eb677c401a6a66b42174358240ad9d2da8312b78bde882bd8afd3cebd343c
|
3 |
+
size 19915449769
|
sphinx-v/stage1/meta.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:26eec800d54c2dce59aaa825e3946e5ac96f8084ed9460c5d115cfe3a19528af
|
3 |
+
size 35
|
sphinx-v/stage1/tokenizer.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
|
3 |
+
size 499723
|
sphinx-v/stage2/config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:12866b022e943d665794df827d575020d5c4db70e3e66d1f9c590d6cdb07b211
|
3 |
+
size 307
|
sphinx-v/stage2/consolidated.00-of-02.model.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3831753397639c88e652577cfc9d258363ec1b828886a3d1ff448cd6e2f16736
|
3 |
+
size 19915442281
|
sphinx-v/stage2/consolidated.01-of-02.model.pth
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6ed36d752126fcc60b98210cf524c40d3ebc40eb81956a8fa09ebc8b4de7afad
|
3 |
+
size 19915442281
|
sphinx-v/stage2/meta.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:26eec800d54c2dce59aaa825e3946e5ac96f8084ed9460c5d115cfe3a19528af
|
3 |
+
size 35
|
sphinx-v/stage2/tokenizer.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
|
3 |
+
size 499723
|
tokenizer/tokenizer.model
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
|
3 |
+
size 499723
|