kaiokendev
/

SuperCOT-LoRA

Model card Files Files and versions Community

kaiokendev commited on Apr 24, 2023

Commit

fa51f55

•

1 Parent(s): 0373904

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -27,12 +27,15 @@ It uses a mixture of the following datasets:
 - GGML 30B 4-bit: [https://huggingface.co/gozfarb/llama-30b-supercot-ggml](https://huggingface.co/gozfarb/llama-30b-supercot-ggml)
 - 30B (unquantized): [https://huggingface.co/ausboss/llama-30b-supercot](https://huggingface.co/ausboss/llama-30b-supercot)
 - 30B 4-bit 128g CUDA: [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda)
 #### 13B
 - 13B (unquantized): [https://huggingface.co/ausboss/llama-13b-supercot](https://huggingface.co/ausboss/llama-13b-supercot)
 - 13B 4-bit 128g CUDA: [https://huggingface.co/ausboss/llama-13b-supercot-4bit-128g](https://huggingface.co/ausboss/llama-13b-supercot-4bit-128g)
 - 13B 4-bit 128g TRITON: [https://huggingface.co/TheYuriLover/llama-13b-SuperCOT-4bit-TRITON](https://huggingface.co/TheYuriLover/llama-13b-SuperCOT-4bit-TRITON)
-- GGML 13B 4-bit: [https://huggingface.co/gozfarb/llama-13b-supercot-ggml](https://huggingface.co/gozfarb/llama-13b-supercot-ggml)
 (Thanks to all the awesome anons with supercomputers)

 - GGML 30B 4-bit: [https://huggingface.co/gozfarb/llama-30b-supercot-ggml](https://huggingface.co/gozfarb/llama-30b-supercot-ggml)
 - 30B (unquantized): [https://huggingface.co/ausboss/llama-30b-supercot](https://huggingface.co/ausboss/llama-30b-supercot)
 - 30B 4-bit 128g CUDA: [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-128g-cuda)
+- 30B 4-bit 128g TRITON: N/A
+- 30B 4-bit CUDA (no groupsize, better VRAM): [https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda](https://huggingface.co/tsumeone/llama-30b-supercot-4bit-cuda)
 #### 13B
+- GGML 13B 4-bit: [https://huggingface.co/gozfarb/llama-13b-supercot-ggml](https://huggingface.co/gozfarb/llama-13b-supercot-ggml)
 - 13B (unquantized): [https://huggingface.co/ausboss/llama-13b-supercot](https://huggingface.co/ausboss/llama-13b-supercot)
 - 13B 4-bit 128g CUDA: [https://huggingface.co/ausboss/llama-13b-supercot-4bit-128g](https://huggingface.co/ausboss/llama-13b-supercot-4bit-128g)
 - 13B 4-bit 128g TRITON: [https://huggingface.co/TheYuriLover/llama-13b-SuperCOT-4bit-TRITON](https://huggingface.co/TheYuriLover/llama-13b-SuperCOT-4bit-TRITON)
+- 13B 4-bit CUDA (no groupsize, better VRAM): N/A
 (Thanks to all the awesome anons with supercomputers)