Running on GPU via HF transformers

by sudhir2016 - opened Mar 18

Discussion

sudhir2016

Mar 18

Runs out of memory on free tier Google Colab.

sudhir2016

Mar 20

As suggested by Eric Alcaide I tried quantization with Hugging Face Quanto. It works fine now. Thanks to @dacorvo for the excellent blog post on Quanto.

sudhir2016 changed discussion status to closed Mar 20

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment