🚩 No files available

by alvarobartt - opened Feb 22, 2024

Discussion

alvarobartt

Feb 22, 2024

Hi here @mlabonne !

Nice and fast GGUF quantized weights of Google's Gemma models, just flagging that there are no files available here.

mlabonne

Owner Feb 22, 2024

Ah maybe don't though, I'm just waiting for llama.cpp to fix the inference. :( I can upload the files but then people will flag it because it doesn't work haha

alvarobartt

Feb 22, 2024

Oh fair! Indeed I'm using it now and seems to be working fine so far with llama-cpp-python, I'm filling a PR to add the gemma formatting

alvarobartt changed discussion status to closed Feb 22, 2024

alvarobartt

Feb 22, 2024

Pasting it here for reference in case that's useful to you @mlabonne https://github.com/abetlen/llama-cpp-python/pull/1210

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment