Error while loading the model

#12
by AL58763 - opened

I'm coming accross an error while trying to load the idefics-9b-instruct model. Even after downloading the model weights and loading them using model = IdeficsForVisionText2Text.from_pretrained(model_path, torch_dtype=torch.bfloat16).to("cuda:1") I'm getting this error:

OSError: Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory model_dir/idefics_9b/.

I haven't changed the name of the weigths after downloading. So not sure what's causing this error.

AL58763 changed discussion title from Is this model still maintained? I'm seeing no replies on the questions to Error while loading the model

I got it working by combining the weights into a single pytorch_model.bin file. I'm trying to run the model on v100 gpu but getting Cuda out of memory error. I have two v100 gpus available and i'm loading the model on a single gpu. Is it possible to split the weights among two gpus?

Hi @AL58763
can you ls the content of your model_dir/idefics_9b/ folder? i am assuming that model_path = model_dir/idefics_9b/

Hi @AL58763
can you ls the content of your model_dir/idefics_9b/ folder? i am assuming that model_path = model_dir/idefics_9b/

Thanks for the reply. Yes, model_path = model_dir/idefics_9b/. Here are the contents of the folder:

config.json
model-00002-of-00002.safetensors
pytorch_model-00002-of-00002.bin
tokenizer.json
generation_config.json
preprocessor_config.json
pytorch_model.bin
tokenizer.model
model-00001-of-00002.safetensors
pytorch_model-00001-of-00002.bin
tokenizer_config.json
urls.txt

As I said above, I combined both the pytorch bin files into one pytorch_model.bin

AL58763 changed discussion status to closed
AL58763 changed discussion status to open

@VictorSanh i can provide local paths of images in the prompts as well right? I provided a local image and then checked inputs['pixel_values'], its just black pixels.

I think you are missing some files in your folder. for instance model.safetensors.index.json is the file that maps each weight to a specific location. Not having this file means that the loading logic does not know where to get the weights (i.e. model-00002-of-00002.safetensors or model-00001-of-00002.safetensors) and as such you had to create a merged pytorch_model.bin. There might be other files missing but that's the first one that came to my mind.

@VictorSanh i can provide local paths of images in the prompts as well right?

On top of my mind, I think you need to load the image (and put it into a pil object) yourself if it's local.

Thanks @VictorSanh it worked!

great! i'll close that discussion. feel free to re-open (or create another one) if you have other questions!

VictorSanh changed discussion status to closed

Sign up or log in to comment