dranger003's picture
Update README.md
344c908 verified
|
raw
history blame
No virus
382 Bytes
metadata
license: llama2
library_name: gguf
pipeline_tag: text-generation

GGUF importance matrix (imatrix) quants for https://huggingface.co/codellama/CodeLlama-70b-Instruct-hf
The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.

Layers Context Template
0
4096
TBD