Just a q8_0 and q4_0 version. If you need versions with other quantization parameters, please let me know

Downloads last month
231
GGUF
Model size
34.4B params
Architecture
llama

4-bit

8-bit

Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.

Datasets used to train TriadParty/deepsex-34b-gguf