Monero's WizardLM-Uncensored-SuperCOT-Storytelling-30B GGML
These files are GGML format model files for Monero's WizardLM-Uncensored-SuperCOT-Storytelling-30B.
Works with latest llama.cpp version. (05/06/23 build = 622)
Prompt template
Optional instruction ("You are a helpful assistant" etc)
USER: prompt
ASSISTANT:
Usage RAM:
llama_model_load_internal: mem required = 19756.67 MB (+ 3124.00 MB per state)
Inference API (serverless) has been turned off for this model.