Model taken from: https://github.com/facebookresearch/tart/tree/main Uploaded to HF for ease of use