add example how to download splits for 1 quant
#6
by
MaziyarPanahi
- opened
README.md
CHANGED
@@ -35,6 +35,14 @@ quantized_by: MaziyarPanahi
|
|
35 |
## Description
|
36 |
[MaziyarPanahi/WizardLM-2-8x22B-GGUF](https://huggingface.co/MaziyarPanahi/WizardLM-2-8x22B-GGUF) contains GGUF format model files for [microsoft/WizardLM-2-8x22B](https://huggingface.co/microsoft/WizardLM-2-8x22B).
|
37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
38 |
## Load sharded model
|
39 |
|
40 |
`llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.
|
|
|
35 |
## Description
|
36 |
[MaziyarPanahi/WizardLM-2-8x22B-GGUF](https://huggingface.co/MaziyarPanahi/WizardLM-2-8x22B-GGUF) contains GGUF format model files for [microsoft/WizardLM-2-8x22B](https://huggingface.co/microsoft/WizardLM-2-8x22B).
|
37 |
|
38 |
+
## How to download
|
39 |
+
You can download only the quants you need instead of cloning the entire repository as follows:
|
40 |
+
|
41 |
+
|
42 |
+
```
|
43 |
+
huggingface-cli download MaziyarPanahi/WizardLM-2-8x22B-GGUF --local-dir . --include '*Q2_K*gguf'
|
44 |
+
```
|
45 |
+
|
46 |
## Load sharded model
|
47 |
|
48 |
`llama_load_model_from_file` will detect the number of files and will load additional tensors from the rest of files.
|