Can you provide the training code for this?

#3
by Aravindan - opened

We are looking for training code for our use.

You can take a look at this demo notebook, illustrating fine-tuning this model on custom data: https://github.com/NielsRogge/Transformers-Tutorials/blob/master/ViLT/Fine_tuning_ViLT_for_VQA.ipynb

This code is showing error when running the line
encoding = processor.feature_extractor.pad_and_create_pixel_mask(pixel_values, return_tensors="pt")
'ViltImageProcessor' object has no attribute 'pad_and_create_pixel_mask'.

This comment has been hidden

Hi, you can use .convert("RGB") to make the greyscale image RGB.

Sign up or log in to comment