I wan to know the mininum VRAM it require?
OpenThaiGPT 1.5 14B is based on Qwen 2.5 14B, which requires at least 29.6 GB of VRAM as suggested by doc (Reference) However, with the quantized version, you can reduce this requirement at the cost of throughput. Someone has done a benchmark here.
ยท Sign up or log in to comment