CRD716
/

ggml-LLaMa-65B-quantized

Text Generation

text-generation-inference

Model card Files Files and versions Community

ggml-LLaMa-65B-quantized / README.md

CRD716's picture

messing with stuff

f962dcd almost 2 years ago

|

395 Bytes

	---
	license: gpl-3.0
	metrics:
	- perplexity
	pipeline_tag: conversational
	tags:
	- LLaMa
	- text-generation-inference
	- ggml
	---

	LLaMa 65B converted to ggml via LLaMa.cpp, then quantized to 4bit.

	I recommend the following settings when running as a good starting point: main.exe -m ggml-LLaMa-65B-q4_0.bin -n -1 -t 42 -c 2048 --temp 0.35 --interactive-first --repeat_penalty 1.2 --instruct --color