Not-For-All-Audiences

Just a q8_0 and q4_0 version. If you need versions with other quantization parameters, please let me know

GGUF

Model size

34.4B params

Architecture

llama

4-bit

8-bit

Inference Examples

Inference API (serverless) does not yet support model repos that contain custom code.

TriadParty
/

deepsex-34b-gguf

Datasets used to train TriadParty/deepsex-34b-gguf