Use quantized version with python?

by acraber - opened

I want to use the quantized version with ctransformers, but I'm not sure what to put for the model type. Also, is there any way to do batch processing?

acraber changed discussion status to closed

Sign up or log in to comment