If I wish to access the weights of the quantized model, do we have a provision to access it. For a llama2 model we would consequently use pth files or bin file to extract the model level information. Can we access the same with this quantized model.
· Sign up or log in to comment