miquella-120b-gguf / README.md
alpindale's picture
Create README.md
2361543 verified
|
raw
history blame
837 Bytes
metadata
tags:
  - merge

Miquella 120B GGUF

GGUF quantized weights for miquella-120b. Contains all quants.

I used Importance Matrices for the quantization, using random data generated from Q8_0 quant of the model for maximum quality.

Due to the limitations of HF's file size, the larger files were split into multiple chunks. Instructions below.

Linux

Example uses Q3_K_L. Replace the names appropriately for your quant of choice.

cat miquella-120b.Q3_K_L.gguf_part_* > miquella-120b.Q3_K_L.gguf && rm miquella-120b.Q3_K_L.gguf_part_*

Windows

Example uses Q3_K_L. Replace the names appropriately for your quant of choice.

COPY /B  miquella-120b.Q3_K_L.gguf_part_aa +  miquella-120b.Q3_K_L.gguf_part_ab  miquella-120b.Q3_K_L.gguf

Then delete the two splits.