Update README.md
Browse files
README.md
CHANGED
@@ -37,7 +37,6 @@ GGML files are for CPU + GPU inference using [llama.cpp](https://github.com/gger
|
|
37 |
|
38 |
## Repositories available
|
39 |
|
40 |
-
* [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/orca_mini_3B-GPTQ)
|
41 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/orca_mini_3B-GGML)
|
42 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/psmathur/orca_mini_3b)
|
43 |
|
|
|
37 |
|
38 |
## Repositories available
|
39 |
|
|
|
40 |
* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/orca_mini_3B-GGML)
|
41 |
* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/psmathur/orca_mini_3b)
|
42 |
|