GrazittiInteractive
/

llama-2-13b

Text Generation

Model card Files Files and versions Community

GrazittiInteractive commited on Aug 2, 2023

Commit

f97bb8a

·

1 Parent(s): b3a4785

Update README.md

Files changed (1) hide show

README.md +0 -10

README.md CHANGED Viewed

@@ -33,16 +33,6 @@ GGML files are for CPU + GPU inference using [llama.cpp](https://github.com/gger
-## Explanation of the new k-quant methods
-<details>
-  <summary>Click to see details</summary>
-The new methods available are:
-* GGML_TYPE_Q4_K - "type-1" 4-bit quantization in super-blocks containing 8 blocks, each block having 32 weights. Scales and mins are quantized with 6 bits. This ends up using 4.5 bpw.
-</details>
-<!-- compatibility_ggml end -->
 ## Provided files
 | Name | Quant method | Bits | Size | Max RAM required | Use case |
 | ---- | ---- | ---- | ---- | ---- | ----- |

 ## Provided files
 | Name | Quant method | Bits | Size | Max RAM required | Use case |
 | ---- | ---- | ---- | ---- | ---- | ----- |