The sample code for usage with Transformers is incorrect.

#45

by endNone - opened May 5, 2024

Discussion

endNone

May 5, 2024

•

edited May 5, 2024

After running the sample code, I encountered the following error:RuntimeError: Expected one of cpu, cuda, ipu, xpu, mkldnn, opengl, opencl, ideep, hip, ve, fpga, ort, xla, lazy, vulkan, mps, meta, hpu, mtia, privateuseone device type at start of device string: auto.It is necessary to change device='auto' to device_map='auto'.

drlee1

May 7, 2024

I have same error. So, I used a pipeline after load the model.

MODEL_ID = "meta-llama/Meta-Llama-3-70B-Instruct"

tok = AutoTokenizer.from_pretrained(MODEL_ID)
model = AutoModelForCausalLM.from_pretrained(
    MODEL_ID,
    device_map = 'auto'

pipe = pipeline(
    "text-generation",
    model = model,
    tokenizer = tok
)
...

ArthurZ

Meta Llama org May 10, 2024

Fixed by #31! Thanks for reporting

ArthurZ changed discussion status to closed May 10, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment