Error code 400 when deploying huggingface bigscience/bloom to SageMaker

#93

by kanikarphan - opened Aug 24, 2022

Discussion

kanikarphan

Aug 24, 2022

Using the sample code below:

I'm getting the following error:

{
  "code": 400,
  "type": "InternalServerException",
  "message": "\u0027bloom\u0027"
}

Any ideas on what could be causing this issue?

kanikarphan changed discussion status to closed Aug 29, 2022

yyb53

Sep 8, 2022

@kanikarphan , I'm seeing exactly the same output, how did you overcome the problem?

kanikarphan

Sep 9, 2022

•

edited Sep 9, 2022

@yyb53 it's because BLOOM requires transformers version 4.21.0 but the inference containers offered only supports up to version 4.17.0. I ended up not using SageMaker. I went with a Serverless approach and leverage our custom container that has transformers version 4.21.0 installed. Even tho I got the BLOOM model working, it is so large its practically unusable. It's unbearably slow running any inference.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment