I have now tried two quantizations 8_0, and 6_K, they both fail like you see below.

by BigDeeper - opened Apr 24, 2024

Apr 24, 2024

~/ollama/ollama run phi-3-mini-128k-instruct.Q6_K
Error: llama runner process no longer running: -1

Apr 24, 2024

microsoft/Phi-3-mini-4k-instruct-gguf does not cause the same error.

Apr 25, 2024

Quant Factory org May 24, 2024

Quants have been updated with the latest release for llama.cpp

munish0838 changed discussion status to closed May 24, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment