Thanks for the ggml model - Docker integration

by flyingkiwiguy - opened Jun 2, 2023

Jun 2, 2023

I've integrated it into a Docker build for "Open Llama-in-a-box" REST API server. Please keep the open-llama-3b-q5_1.bin file available for download. Full details here:

https://github.com/abetlen/llama-cpp-python/pull/310

SlyEcho

Owner Jun 5, 2023

I will probably update the same repo when newer checkpoints are released, would that be alright?

The conversion itself is not anything complex. I will add the Makefile here someday.

I could maybe leave a branch for this specific one.

flyingkiwiguy

Jun 7, 2023

Sorry, didn't see your message.

@abetlen accepted my PR, so you can find the latest stable "open llama in-a-box" here:

https://github.com/abetlen/llama-cpp-python/tree/main/docker/open_llama

I just smoke tested your updated 3B repo and it looks to work without any changes with the above code. Once you have the 7B model converted, it a single line change in build.sh to d/l and package that, instead.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment