This change stopped the Runtime error for me, although getting a 504 when calling the Llama model. 36b445a verified meg HF staff commited on 1 day ago