Not able to run hello world example, bigcode/starcoder is not a valid model identifier

#11

by rameshn - opened May 5, 2023

May 5, 2023

Hi,
I tried running the following, and get an error. Is the model path correct ?
Thanks.

--
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
model_ckpt = "bigcode/starcoder"
#model_ckpt = "https://huggingface.co/bigcode/starcoder"

model = AutoModelForCausalLM.from_pretrained(model_ckpt)
tokenizer = AutoTokenizer.from_pretrained(model_ckpt)

pipe = pipeline("text-generation", model=model, tokenizer=tokenizer, device="cuda:2")
print( pipe("def hello():") )

--
...
OSError: bigcode/starcoder is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'

br3w0r

May 5, 2023

You have to login using huggingface_hub. Look at the Quickstart guide

ArmelR

BigCode org May 5, 2023

Hi. You should go to hf.co/bigcode/starcoder and fill accept the agreement if you want to be able to use the model. You would also want to connect using huggingface-cli.

tarodnet

May 5, 2023

Maybe you also need some requirements:

https://github.com/bigcode-project/starcoder

cactusthecoder8

May 6, 2023

Set your personal access token to the use_auth_token parameter when you call from_pretrained.

ViPo

May 6, 2023

I have same issue. Tried to login using huggingface_hub, install all requirements, only @cactusthecoder8 recomendation helped, to use use_auth_token parameter to download tokenizer, but then running fails on the next line
model = AutoModelForCausalLM.from_pretrained(checkpoint, trust_remote_code=True).to(device)
With error:

Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'. ```

loubnabnl

BigCode org May 6, 2023

Can you try adding use_auth_token to model loading too (btw you don't need trust_remote_code=True). Overall if you accept the agreement on the model page and follow these steps it should work (assuming you have enough memory):

!pip install transformers==4.28.1
!huggingface-cli login

from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("bigcode/starcoder")
model = AutoModelForCausalLM.from_pretrained("bigcode/starcoder")

tarodnet

May 6, 2023

Can you try adding use_auth_token to model loading too (btw you don't need trust_remote_code=True). Overall if you accept the agreement on the model page and follow these steps it should work (assuming you have enough memory):
!pip install transformers==4.28.1
!huggingface-cli login
from transformers import AutoModelForCausalLM, AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("bigcode/starcoder")
model = AutoModelForCausalLM.from_pretrained("bigcode/starcoder")

What is the recommended memory?

cactusthecoder8

May 6, 2023

•

edited May 14, 2023

What is the recommended memory?

RAM: There are 7 checkpoints with approximately 10G per shard.
GPU memory: definitely > 16 G but 32G is enough

tarodnet

May 6, 2023

•

edited May 7, 2023

What is the recommended memory?

RAM: There are 7 checkpoints with approximately 10G per shard.
GPU memory: definitely > 16 G but 24G is enough

Oh :( I suppose that is the reason I'm getting a killed application when I try to run the example... thanks mate!

rameshn

May 10, 2023

Summarizing the suggestions that fixed the problem for me: (these are in addition to pip install of requirements.txt)
a) accept the license agreement on https://huggingface.co/bigcode/starcoder
b) get access token for starcoder from https://huggingface.co/settings/tokens
c) run 'huggingface-cli login' --> use token provided above

breakenknife

May 30, 2023

•

edited May 30, 2023

@rameshn how to accept the license agreement ?

Summarizing the suggestions that fixed the problem for me: (these are in addition to pip install of requirements.txt)
a) accept the license agreement on https://huggingface.co/bigcode/starcoder
b) get access token for starcoder from https://huggingface.co/settings/tokens
c) run 'huggingface-cli login' --> use token provided above

kingzzm

Jun 25, 2023

@rameshn how to accept the license agreement ?

arita89

Jul 4, 2023

What is the recommended memory?

RAM: There are 7 checkpoints with approximately 10G per shard.
GPU memory: definitely > 16 G but 32G is enough

wait,me stupid, do i need 10G or 70G of RAM?

thanhnew2001

Jul 24, 2023

70GB for storage

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment