llama-cpp-wasm   🐇 multithreading wasm32


WebAssembly (Wasm) Build and Bindings for llama.cpp.


This demonstration enables you to run LLM models directly in your browser utilizing JavaScript, WebAssembly, and llama.cpp.


Repository: https://github.com/tangledgroup/llama-cpp-wasm


When you click Run, model will be first downloaded and cached in browser.

Demo