during my infer test, the gpu memory usage up to 70G
me too, why the memory usage of int4 model is up to 70G?
· Sign up or log in to comment