metadata
license: apache-2.0
Multiformer
Multiformer is a multi-task vision transformer architecture designed to provide strong perception capabilities with a nimble and lightweight architecture.
This model uses a custom branch of the transformers library, which can be installed easily using the instructions below.
For training and evaluation, a custom MultitaskTrainer class is used that can handle complex nested losses and successfully log them to wandb.
A training/eval and inference script are both available in the project repository.
Setup Instructions
- Open a terminal and navigate to your root folder, then run
git clone https://github.com/FoamoftheSea/shift-experiments.git
- Follow the setup instructions for your operating system found in the README
Quick Model Import
You should now be able to run the following code to load a Multiformer-M0 with pretrained weights:
from transformers import AutoModel
multiformer = AutoModel.from_pretrained("FoamoftheSea/multiformer-m0")