aifeifei798's picture
Upload README.md
05709f6 verified
|
raw
history blame
5.2 kB
metadata
license: llama3
language:
  - en
tags:
  - roleplay
  - llama3
  - sillytavern
  - idol

image/png

Special Thanks:

These are my own quantizations (updated almost daily).

The difference with normal quantizations is that I quantize the output and embed tensors to f16. and the other tensors to 15_k,q6_k or q8_0. This creates models that are little or not degraded at all and have a smaller size. They run at about 3-6 t/sec on CPU only using llama.cpp And obviously faster on computers with potent GPUs

Model Description:

The module combination has been readjusted to better fulfill various roles and has been adapted for mobile phones.

  • Saving money(LLama 3.1)
  • only test en.
  • Input Models input text only. Output Models generate text and code only.
  • Uncensored
  • Quick response
  • A scholarly response akin to a thesis.(I tend to write songs extensively, to the point where one song almost becomes as detailed as a thesis. :)
  • DarkIdol:Roles that you can imagine and those that you cannot imagine.
  • Roleplay
  • Specialized in various role-playing scenarios

How To

Llama 3.1 is a new model and may still experience issues such as refusals (which I have not encountered in my tests). Please understand. If you have any questions, feel free to leave a comment, and I will respond as soon as I see it.

virtual idol Twitter

Questions

  • The model's response results are for reference only, please do not fully trust them.
  • This model is solely for learning and testing purposes, and errors in output are inevitable. We do not take responsibility for the output results. If the output content is to be used, it must be modified; if not modified, we will assume it has been altered.
  • For commercial licensing, please refer to the Llama 3.1 agreement.

Stop Strings

    stop = [
      "## Instruction:",
      "### Instruction:",
      "<|end_of_text|>",
      "  //:",
      "</s>",
      "<3```",
      "### Note:",
      "### Input:",
      "### Response:",
      "### Emoticons:"
    ],

More Model Use

character

If you want to use vision functionality:

  • You must use the latest versions of Koboldcpp.

To use the multimodal capabilities of this model and use vision you need to load the specified mmproj file, this can be found inside this model repo. Llava MMProj

  • You can load the mmproj by using the corresponding section in the interface: image/png