3.78 TB
228 files
Updated 25 days ago
Name
Size
IQ1_M
IQ1_S
IQ2_M
IQ2_S
IQ2_XS
IQ2_XXS
IQ3_M
IQ3_S
IQ3_XS
IQ3_XXS
IQ4_NL
IQ4_XS
Q2_K
Q2_K_S
Q3_K_L
Q3_K_M
Q3_K_S
Q4_K
Q5_K
Q6_K
Q8_0
.gitattributes1.56 kB
xet
README.md5.45 kB
xet
config.json29 Bytes
xet
grok-1-IQ3_XS-split-00001-of-00009.gguf17.5 GB
xet
grok-1-IQ3_XS-split-00002-of-00009.gguf15.3 GB
xet
grok-1-IQ3_XS-split-00003-of-00009.gguf15.2 GB
xet
grok-1-IQ3_XS-split-00004-of-00009.gguf15.6 GB
xet
grok-1-IQ3_XS-split-00005-of-00009.gguf15.4 GB
xet
grok-1-IQ3_XS-split-00006-of-00009.gguf15.2 GB
xet
grok-1-IQ3_XS-split-00007-of-00009.gguf15.2 GB
xet
grok-1-IQ3_XS-split-00008-of-00009.gguf16 GB
xet
grok-1-IQ3_XS-split-00009-of-00009.gguf4.18 GB
xet
grok-1-Q2_K-split-00001-of-00009.gguf14.9 GB
xet
grok-1-Q2_K-split-00002-of-00009.gguf13.8 GB
xet
grok-1-Q2_K-split-00003-of-00009.gguf13.8 GB
xet
grok-1-Q2_K-split-00004-of-00009.gguf14.2 GB
xet
grok-1-Q2_K-split-00005-of-00009.gguf14 GB
xet
grok-1-Q2_K-split-00006-of-00009.gguf13.8 GB
xet
grok-1-Q2_K-split-00007-of-00009.gguf13.8 GB
xet
grok-1-Q2_K-split-00008-of-00009.gguf14.1 GB
xet
grok-1-Q2_K-split-00009-of-00009.gguf3.6 GB
xet
grok-1-Q4_K-split-00001-of-00009.gguf25.9 GB
xet
grok-1-Q4_K-split-00002-of-00009.gguf22.3 GB
xet
grok-1-Q4_K-split-00003-of-00009.gguf22.6 GB
xet
grok-1-Q4_K-split-00004-of-00009.gguf22.8 GB
xet
grok-1-Q4_K-split-00005-of-00009.gguf22.9 GB
xet
grok-1-Q4_K-split-00006-of-00009.gguf22.4 GB
xet
grok-1-Q4_K-split-00007-of-00009.gguf22.4 GB
xet
grok-1-Q4_K-split-00008-of-00009.gguf24.7 GB
xet
grok-1-Q4_K-split-00009-of-00009.gguf6.39 GB
xet
grok-1-Q6_K-split-00001-of-00009.gguf32.6 GB
xet
grok-1-Q6_K-split-00002-of-00009.gguf31 GB
xet
grok-1-Q6_K-split-00003-of-00009.gguf31 GB
xet
grok-1-Q6_K-split-00004-of-00009.gguf31.9 GB
xet
grok-1-Q6_K-split-00005-of-00009.gguf31.5 GB
xet
grok-1-Q6_K-split-00006-of-00009.gguf31 GB
xet
grok-1-Q6_K-split-00007-of-00009.gguf31 GB
xet
grok-1-Q6_K-split-00008-of-00009.gguf31.7 GB
xet
grok-1-Q6_K-split-00009-of-00009.gguf8.08 GB
xet
README.md

Grok-1 GGUF Quantizations

This repository contains unofficial GGUF Quantizations of Grok-1, compatible with llama.cpp as of PR- Add grok-1 support #6204.

Updates

Native Split Support in llama.cpp

  • The splits have been updated to utilize the improvements from PR: llama_model_loader: support multiple split/shard GGUFs. As a result, manual merging with gguf-split is no longer required.

    With this, there is no need to merge the split files before use. Just download all splits and run llama.cpp with the first split like you would previously. It'll detect the other splits and load them as well.

Direct Split Download from huggingface using llama.cpp

server \
    --hf-repo Arki05/Grok-1-GGUF \
    --hf-file grok-1-IQ3_XS-split-00001-of-00009.gguf \
    --model models/grok-1-IQ3_XS-split-00001-of-00009.gguf \
    -ngl 999

And that is very cool (@phymbert)

Available Quantizations

The following Quantizations are currently available for download:

Quant Split Files Size
Q2_K 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 112.4 GB
IQ3_XS 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 125.4 GB
Q4_K 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 186.0 GB
Q6_K 1-of-9, 2-of-9, 3-of-9, 4-of-9, 5-of-9, 6-of-9, 7-of-9, 8-of-9, 9-of-9 259.8 GB

I would recommend the IQ3_XS version for now.

More Quantizations will be uploaded soon. All current Quants are created without any importance matrix.

Total size
3.78 TB
Files
228
Last updated
Jun 6
Pre-warmed CDN
US EU US EU

Contributors