RachidAR's picture
Update README.md
02328a7
---
inference: false
license: other
---
# Monero's WizardLM-Uncensored-SuperCOT-Storytelling-30B GGML
These files are GGML format model files for [Monero's WizardLM-Uncensored-SuperCOT-Storytelling-30B](https://huggingface.co/Monero/WizardLM-Uncensored-SuperCOT-StoryTelling-30b).
# Works with latest llama.cpp version. (05/06/23 build = 622)
## Prompt template
```
Optional instruction ("You are a helpful assistant" etc)
USER: prompt
ASSISTANT:
```
*This 2 bit model can run with 16 GB of RAM.*
*On my Xeon E3-1225 v3 4/8 old cpu, it runs with ~660 ms per token.*