OSError: no file named pytorch_model.bin

by ymoslem - opened Aug 4, 2023

Aug 4, 2023

Hello!

I tried this code:

from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("amazon/FalconLite", trust_remote_code=True)

It results in this error:

OSError: amazon/FalconLite does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.

I am not using SageMaker. Just a regular GPU.

Thanks!

moniljo

Amazon Web Services org Aug 5, 2023

You'll need to add the model_basename as well -> gptq_model-4bit-128g.safetensors
However, it looks like generation_config.json is missing, so it won't work locally
Error:
OSError: amazon/FalconLite does not appear to have a file named generation_config.json. Checkout 'https://huggingface.co/amazon/FalconLite/main' for available files.

jangtu052

Aug 11, 2023

I guess they only made change in TGI codebase, so it won't run locally with scaledRoPE because there is no code change from the original Falcon repo. Please correct me if I am wrong and eager to learn.

thiago-weni

Aug 11, 2023

Does it means there is no way to finetune it (again)?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment