Upload gguf-imat-llama-3.py

#30

by SolidSnacke - opened May 11

base: refs/heads/main

←

from: refs/pr/30

Discussion Files changed

+180

-168

SolidSnacke

May 11

I advise you to create a separate branch for the version of this file, since the most likely thing in the future is to solve the problem with running bf16.gguf on the gpu, due to which it is impossible to create imatrix.dat. Therefore, we had to create imatrix.dat from f16.gguf and quantized models from bf16.gguf.

Upload gguf-imat-llama-3.py03d6cce0

FantasiaFoundry

Owner May 12

Thanks! Will keep as an additional file.

FantasiaFoundry changed pull request status to merged May 12

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment