Amazon sagemaker deploy

by ghpkishore - opened Jun 30, 2022

Jun 30, 2022

I clicked on the Amazon Sagemaker deploy and followed the steps given. However, it throws an error. Should I change the "instance type" parameter?

The inference widget doesn't run, Sagemaker also doesn't work. So I do not know how to use this model. Help is massively appreciated.

ghpkishore

Jun 30, 2022

I realised that there is no way to actually run these models on system without having access to A100 GPU's. In Amazon to request that instance, it costs approx. USD 32. Therefore, unless there are folks from big companies or academia, this model cannot be used.

julien-c

Jul 1, 2022

pinging @philschmid just for visibility

philschmid

Google org Jul 1, 2022

Hello @ghpkishore ,

It should be possible to run the model with Large Model Loading on Amazon SageMaker. But there is not yet a container with the supported transformers version available meaning you would need to create a custom inference.py + requirements.txt to deploy to sagemaker. example

For instance type i am not sure which one is enough I would either try g5.12xlarge or p3.8xlarge.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment