How to reduce GPU memory?

#2
by ulrika-cyl - opened

image.png

during my infer test, the gpu memory usage up to 70G

me too, why the memory usage of int4 model is up to 70G?

Sign up or log in to comment