Running on GPU via HF transformers

by sudhir2016 - opened

Runs out of memory on free tier Google Colab.

As suggested by Eric Alcaide I tried quantization with Hugging Face Quanto. It works fine now. Thanks to @dacorvo for the excellent blog post on Quanto.

sudhir2016 changed discussion status to closed

Sign up or log in to comment