Upload 10 files

Browse files

Files changed (10) hide show

Karen-The-Editor-GPTQ-4bit-128g.no-act.order.safetensors +3 -0
README.md +134 -0
config.json +24 -0
generation_config.json +7 -0
huggingface-metadata.txt +6 -0
quantize_config.json +8 -0
special_tokens_map.json +24 -0
tokenizer.json +0 -0
tokenizer.model +3 -0
tokenizer_config.json +34 -0

Karen-The-Editor-GPTQ-4bit-128g.no-act.order.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4268c96414b21c159a97bce0759e3870a5fd128a5069ac539a04b163a0579b1a
+size 7454797160

README.md ADDED Viewed

	@@ -0,0 +1,134 @@

+---
+inference: false
+license: other
+---
+<!-- header start -->
+<div style="width: 100%;">
+    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<div style="display: flex; justify-content: space-between; width: 100%;">
+    <div style="display: flex; flex-direction: column; align-items: flex-start;">
+        <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
+    </div>
+    <div style="display: flex; flex-direction: column; align-items: flex-end;">
+        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
+    </div>
+</div>
+<!-- header end -->
+# FPHam's Karen The Editor 13B GPTQ
+These files are GPTQ 4bit model files for [FPHam's Karen The Editor 13B](https://huggingface.co/FPHam/Karen_theEditor_13b_HF).
+It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com/qwopqwop200/GPTQ-for-LLaMa).
+## Other repositories available
+* [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Karen_theEditor_13B-GPTQ)
+* [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/Karen_theEditor_13B-GGML)
+* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/FPHam/Karen_theEditor_13b_HF
+## Prompt template
+```
+USER: Edit the following for spelling and grammar mistakes:
+ASSISTANT:
+```
+## How to easily download and use this model in text-generation-webui
+### Downloading the model
+1. Click the **Model tab**.
+2. Under **Download custom model or LoRA**, enter `TheBloke/Karen_theEditor_13B-GPTQ`.
+3. Click **Download**.
+4. Wait until it says it's finished downloading.
+5. Untick "Autoload model"
+6. Click the **Refresh** icon next to **Model** in the top left.
+### To use with AutoGPTQ (if installed)
+1. In the **Model drop-down**: choose the model you just downloaded, `Karen_theEditor_13B-GPTQ`.
+2. Under **GPTQ**, tick **AutoGPTQ**.
+3. Click **Save settings for this model** in the top right.
+4. Click **Reload the Model** in the top right.
+5. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
+### To use with GPTQ-for-LLaMa
+1. In the **Model drop-down**: choose the model you just downloaded, `Karen_theEditor_13B-GPTQ`.
+2. If you see an error in the bottom right, ignore it - it's temporary.
+3. Fill out the `GPTQ parameters` on the right: `Bits = 4`, `Groupsize = 128`, `model_type = Llama`
+4. Click **Save settings for this model** in the top right.
+5. Click **Reload the Model** in the top right.
+6. Once it says it's loaded, click the **Text Generation tab** and enter a prompt!
+## Provided files
+**Karen-The-Editor-GPTQ-4bit-128g.no-act.order.safetensors**
+This will work with all versions of GPTQ-for-LLaMa, and with AutoGPTQ.
+It was created with
+* `Karen-The-Editor-GPTQ-4bit-128g.no-act.order.safetensors`
+  * Works with all versions of GPTQ-for-LLaMa code, both Triton and CUDA branches
+  * Works with AutoGPTQ
+  * Works with text-generation-webui one-click-installers
+  * Parameters: Groupsize = 128. Act Order / desc_act = False.
+<!-- footer start -->
+## Discord
+For further support, and discussions on these models and AI in general, join us at:
+[TheBloke AI's Discord server](https://discord.gg/UBgz4VXf)
+## Thanks, and how to contribute.
+Thanks to the [chirper.ai](https://chirper.ai) team!
+I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
+If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
+* Patreon: https://patreon.com/TheBlokeAI
+* Ko-Fi: https://ko-fi.com/TheBlokeAI
+**Patreon special mentions**: Aemon Algiz; Dmitiry Samsonov; Jonathan Leane; Illia Dulskyi; Khalefa Al-Ahmad; Nikolai Manek; senxiiz; Talal Aujan; vamX; Eugene Pentland; Lone Striker; Luke Pendergrass; Johann-Peter Hartmann.
+Thank you to all my generous patrons and donaters.
+<!-- footer end -->
+# Original model card: FPHam's Karen The Editor 13B
+## Karen is an editor for your fiction.
+She fixes grammar and wording issues, but doesn't necessary start rewording everything like ChatGPT into a corporate talk. So it should keep the style intact.
+Based on LLAMA 13b and Wizard-Vucna-uncensored finetune, then finetuned with about 20k grammar examples (bad grammar/good grammar).
+Karen gets triggered by this prompt (pun intended):
+```
+USER: Edit the following for spelling and grammar mistakes:
+ASSISTANT:
+```
+Feed it a paragraph at a time - that's where she works best - a few sentences.
+Here is my instruct yaml for oobabooga webui (in Instruct mode):
+```
+user: "USER: Edit the following for spelling and grammar mistakes: "
+bot: "ASSISTANT:"
+turn_template: "<|user|> <|user-message|>\n<|bot|> <|bot-message|></s>\n"
+context: "Assistant is a meticulous senior editor with a specialization in editing fictional stories. When given text, Assistant detects and corrects grammatical errors, including subject-verb agreement, tense consistency, punctuation, capitalization, use of correct articles and correct present perfect and past perfect tense.\n"
+```
+I am not sure to what extend the context part has any effect. Doesn't hurt though.
+This is a version 0.2 and finetuned just so it retains the underlaying Vicuna qualities.

config.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+    "_name_or_path": "H:\\oobabooga_windows\\text-generation-webui\\models\\TheBloke_Wizard-Vicuna-13B-Uncensored-HF",
+    "architectures": [
+        "LlamaForCausalLM"
+    ],
+    "bos_token_id": 1,
+    "eos_token_id": 2,
+    "hidden_act": "silu",
+    "hidden_size": 5120,
+    "initializer_range": 0.02,
+    "intermediate_size": 13824,
+    "max_position_embeddings": 2048,
+    "max_sequence_length": 2048,
+    "model_type": "llama",
+    "num_attention_heads": 40,
+    "num_hidden_layers": 40,
+    "pad_token_id": 0,
+    "rms_norm_eps": 1e-06,
+    "tie_word_embeddings": false,
+    "torch_dtype": "float16",
+    "transformers_version": "4.30.0.dev0",
+    "use_cache": true,
+    "vocab_size": 32000
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "eos_token_id": 2,
+  "pad_token_id": 0,
+  "transformers_version": "4.30.0.dev0"
+}

huggingface-metadata.txt ADDED Viewed

	@@ -0,0 +1,6 @@

+url: https://huggingface.co/TheBloke/Karen_theEditor_13B-GPTQ
+branch: main
+download date: 2023-06-03 03:34:33
+sha256sum:
+    4268c96414b21c159a97bce0759e3870a5fd128a5069ac539a04b163a0579b1a Karen-The-Editor-GPTQ-4bit-128g.no-act.order.safetensors
+    9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347 tokenizer.model

quantize_config.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+    "bits": 4,
+    "group_size": 128,
+    "damp_percent": 0.01,
+    "desc_act": false,
+    "sym": true,
+    "true_sequential": true
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,24 @@

+{
+  "bos_token": {
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "pad_token": "<unk>",
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e556afd44213b6bd1be2b850ebbbd98f5481437a8021afaf58ee7fb1818d347
+size 499723

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "add_bos_token": true,
+  "add_eos_token": false,
+  "bos_token": {
+    "__type": "AddedToken",
+    "content": "<s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "clean_up_tokenization_spaces": false,
+  "eos_token": {
+    "__type": "AddedToken",
+    "content": "</s>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  },
+  "model_max_length": 2048,
+  "pad_token": null,
+  "padding_side": "right",
+  "sp_model_kwargs": {},
+  "tokenizer_class": "LlamaTokenizer",
+  "unk_token": {
+    "__type": "AddedToken",
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": true,
+    "rstrip": false,
+    "single_word": false
+  }
+}