Model
To add a new model, you need to modify the following parts of the code:
- Modify
configuration_*.py
file, especially thefrom_*
static method. - Add a
modeling_*.py
files. Note ahout thefrom_*
static method, which calls thefrom_*
static method fromconfiguration_*.py
. - Add an argument class in
arguments.py
accordingly. - Add the new model in
__init__.py
. - Import the new model and argument class in the main script like
train.py
, and callfrom_*
according to its parameters. - May need to add processor.
Architecture
Multitaskv2
base_sca_multitask_v2
It uses task_type
to activate different task tokens, which are recognition
and caption
DirectDecodingv2 (MultitaskV2)
base_sca_direct_decoding_v2
Like Multitaskv2, but the caption tokens are the query tokens of SAM.
SplitMixer (Multitaskv2)
base_sca_multitask_split_mixer
Like Multitaskv2, but it does not based on the fused tokens from SAM's feature mixer.
ROI Pooler (Multitaskv2)
Other Image features (Multitaskv2)
Inputs and Outputs
SCA trainer requires that every items in logits
should not be None
.
When it gathers the results across devices during inference, it calls self._pad_across_processes
which recursively concatenates tensors.
Attributes and Methods
TBD
HF Trainer Adaption
TBD
SAM Models
https://huggingface.co/facebook/sam-vit-base
- facebook/sam-vit-base
https://huggingface.co/facebook/sam-vit-large
- facebook/sam-vit-large
https://huggingface.co/facebook/sam-vit-huge
- facebook/sam-vit-huge
Language Models
https://huggingface.co/gpt2-large
- gpt2-large
https://huggingface.co/openlm-research/open_llama_3b_v2
- openlm-research/open_llama_3b_v2
https://huggingface.co/stabilityai/stablelm-3b-4e1t
- stabilityai/stablelm-3b-4e1t
https://huggingface.co/stabilityai/stablelm-zephyr-3b
- stabilityai/stablelm-zephyr-3b
- modle after SFT and RLAIF
- the tokenizer is update from
GPTNeoXTokenizer
- Need the latest version of transformers.
https://huggingface.co/microsoft/phi-2
- microsoft/phi-2
- d3186761bf5c4409f7679359284066c25ab668ee
https://huggingface.co/HuggingFaceH4/zephyr-7b-beta
- HuggingFaceH4/zephyr-7b-beta
https://huggingface.co/HuggingFaceH4/zephyr-7b-alpha
- HuggingFaceH4/zephyr-7b-alpha
https://huggingface.co/mistralai/Mistral-7B-v0.1
- mistralai/Mistral-7B-v0.1
https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1
- mistralai/Mistral-7B-Instruct-v0.1
https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2
- mistralai/Mistral-7B-Instruct-v0.2