from_pretrained("google/flan-t5-xxl") cannot work

#15

by Tebmer - opened Dec 22, 2022

Discussion

Tebmer

Dec 22, 2022

•

edited Dec 22, 2022

When I run

model = T5ForConditionalGeneration.from_pretrained("google/flan-t5-xxl").to("cuda")

It shows

404 Client Error: Not Found for url: https://huggingface.co/google/flan-t5-xxl/resolve/main/pytorch_model.bin

How can I solve it? Thanks!

lysandre

Google org Dec 22, 2022

•

edited Dec 22, 2022

Hey @Tebmer ! What is your transformers version?

I'm asking as the flan-t5-xxl model is sharded, which means that you'll need a transformers version that handles loading sharded models.

I would recommend updating your transformers version to the latest one and testing once again.

Tebmer

Dec 22, 2022

Hey @Tebmer ! What is your transformers version?

I'm asking as the flan-t5-xxl model is sharded, which means that you'll need a transformers version that handles loading sharded models.

I would recommend updating your transformers version to the latest one and testing once again.

Yeah, it solves my problem. Thanks a lot!

Tebmer changed discussion status to closed Dec 22, 2022

ybelkada

Dec 22, 2022

as a side note, flan-xxl is quite large, I would advice you to load it either in half-precision or 8-bit:

# pip install accelerate bitsandbytes
import torch 

model = T5ForConditionalGeneration.from_pretrained("google/flan-t5-xxl", device_map="auto",  load_in_8bit=True)

Tebmer

Dec 22, 2022

Thanks for reminding. I will try it.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment