Quantizations of https://huggingface.co/TheSkullery/llama-3-cat-8b-instruct-v1

From original readme

Cat-llama3-instruct is a llama 3 8b finetuned model focusing on system prompt fidelity, helpfulness and character engagement. The model aims to respect system prompt to an extreme degree, provide helpful information regardless of situations, and offer maximum character immersion (Role Play) in given scenes.

Downloads last month: 145

GGUF

Model size

8.03B params

Architecture

llama

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

View +3 files

Inference Examples

Text Generation

Inference API (serverless) has been turned off for this model.