RachidAR
/

WizardLM-Uncensored-SCOT-StoryTelling-30B-Q2_K-GGML

Monero's WizardLM-Uncensored-SuperCOT-Storytelling-30B GGML

Optional instruction ("You are a helpful assistant" etc)
USER: prompt
ASSISTANT:

This 2 bit model can run with 16 GB of RAM. On my Xeon E3-1225 v3 4/8 old cpu, it runs with ~660 ms per token.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference API

Inference API (serverless) has been turned off for this model.