[Code Help] quick start code snippet taking too long to generate a response

#194

by ppoptart - opened Jul 3, 2024

Jul 3, 2024

Hi, I have tried running the code below on both my local VS code and a google Colab but it is taking very long to run/never completes generating. Can someone help me fix this, or is this normal behaviour?

import transformers
import torch
access_token = 'MY_TOKEN'

model_id = "meta-llama/Meta-Llama-3-8B"

pipeline = transformers.pipeline(
"text-generation", model=model_id, model_kwargs={"torch_dtype": torch.bfloat16}, device_map="auto",token=access_token
)
pipeline("Hey how are you doing today?")

rain7996

Jul 4, 2024

I have the same problem. My platform is a cloud virtual machine with 32GB memory, 16 Core AMD CPU, no GPU. It takes about 30-60 minutes to generate the answer. If anyone has the optimization method, discuss with me plz. Thanks a lot.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment