--- license: wtfpl --- 1. Go to [llama.cpp](https://github.com/ggerganov/llama.cpp/releases/) and download one of those folders ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/2frhHG2gJmzgNRWiPhdIO.png) 2. If you're about to use CUDA - check the version your card supports(12.2 for any RTX) and download one of those folders ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/tampPIs0mt6J86VHTogEf.png) 3. Unpack everything in one folder and rename it to "LlamaCPP", put this folder in the same folder where main.py/main.exe file is ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/II3FHj0WxzT_3Zi60Us5u.png) 4. Launch main.py/main.exe file ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64f5e51289c121cb864ba464/sTM3GUVMucM_AnIk8iG7H.png)