Can you provide the training code for this?

by Aravindan - opened Sep 6, 2022

Discussion

Aravindan

Sep 6, 2022

We are looking for training code for our use.

nielsr

Dec 15, 2022

You can take a look at this demo notebook, illustrating fine-tuning this model on custom data: https://github.com/NielsRogge/Transformers-Tutorials/blob/master/ViLT/Fine_tuning_ViLT_for_VQA.ipynb

manasj

Jul 20, 2023

This code is showing error when running the line
encoding = processor.feature_extractor.pad_and_create_pixel_mask(pixel_values, return_tensors="pt")
'ViltImageProcessor' object has no attribute 'pad_and_create_pixel_mask'.

nielsr

Jul 20, 2023

Hi,

The method is now called pad: https://github.com/huggingface/transformers/blob/35c04596f8938370dd5a2930fb724781f8ea35b0/src/transformers/models/vilt/image_processing_vilt.py#L296. Apologies for this, will update my notebook

roolaml

May 13, 2024

This comment has been hidden

nielsr

May 13, 2024

Hi, you can use .convert("RGB") to make the greyscale image RGB.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment