ausboss
/

llama-13b-supercot-4bit-128g

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama-13b-supercot-4bit-128g / README.md

ausboss's picture

Update README.md

9a5ee80 over 1 year ago

|

history blame contribute delete

374 Bytes

This model is a merge of LLAMA-13b and SuperCOT LoRA

huggyllama/llama-13b + kaiokendev/SuperCOT-LoRA/13b/gpu/cutoff-2048

CUDA_VISIBLE_DEVICES=0 python llama.py c4 --wbits 4 --true-sequential --act-order --groupsize 128

In ooba make sure to use --groupsize 128 --wbits 4