eachadea
/

legacy-ggml-vicuna-7b-4bit

Text Generation

text-generation-inference

Model card Files Files and versions Community

legacy-ggml-vicuna-7b-4bit / README.md

eachadea's picture

Update README.md

4d101c5 over 1 year ago

|

No virus

478 Bytes

metadata

pipeline_tag: conversational
tags:
  - vicuna
  - llama
  - text-generation-inference

Converted for use with llama.cpp

4-bit quantized
Needs ~6GB of CPU RAM
Won't work with alpaca.cpp or old llama.cpp (new ggml format requires latest llama.cpp)
7B parameter version

Bigger 13B version can be found here: https://huggingface.co/eachadea/ggml-vicuna-13b-4bit

tags: - vicuna - llama - text-generation-inference