The broad range of quantization offers from llama.cpp and the slim design would fit well to this project.
I hope to see support for this opensource engine.
· Sign up or log in to comment