TheBloke
/

Inkbot-13B-8k-0.2-GPTQ

@@ -26,6 +26,7 @@ prompt_template: '<#meta#>
   '
 quantized_by: TheBloke
 ---
 <!-- header start -->
 <!-- 200823 -->
@@ -185,7 +186,7 @@ Note that using Git with HF repos is strongly discouraged. It will be much slowe
 <!-- README_GPTQ.md-download-from-branches end -->
 <!-- README_GPTQ.md-text-generation-webui start -->
-## How to easily download and use this model in [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 Please make sure you're using the latest version of [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
@@ -193,16 +194,20 @@ It is strongly recommended to use the text-generation-webui one-click-installers
 1. Click the **Model tab**.
 2. Under **Download custom model or LoRA**, enter `TheBloke/Inkbot-13B-8k-0.2-GPTQ`.
-  - To download from a specific branch, enter for example `TheBloke/Inkbot-13B-8k-0.2-GPTQ:gptq-4bit-32g-actorder_True`
-  - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
 5. In the top left, click the refresh icon next to **Model**.
 6. In the **Model** dropdown, choose the model you just downloaded: `Inkbot-13B-8k-0.2-GPTQ`
 7. The model will automatically load, and is now ready for use!
 8. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
-  * Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
-9. Once you're ready, click the **Text Generation tab** and enter a prompt to get started!
 <!-- README_GPTQ.md-text-generation-webui end -->
@@ -214,7 +219,7 @@ It's recommended to use TGI version 1.1.0 or later. The official Docker containe
 Example Docker parameters:
 ```shell
---model-id TheBloke/Inkbot-13B-8k-0.2-GPTQ --port 3000 --quantize awq --max-input-length 3696 --max-total-tokens 4096 --max-batch-prefill-tokens 4096
 ```
 Example Python code for interfacing with TGI (requires huggingface-hub 0.17.0 or later):

   '
 quantized_by: TheBloke
 ---
+<!-- markdownlint-disable MD041 -->
 <!-- header start -->
 <!-- 200823 -->
 <!-- README_GPTQ.md-download-from-branches end -->
 <!-- README_GPTQ.md-text-generation-webui start -->
+## How to easily download and use this model in [text-generation-webui](https://github.com/oobabooga/text-generation-webui)
 Please make sure you're using the latest version of [text-generation-webui](https://github.com/oobabooga/text-generation-webui).
 1. Click the **Model tab**.
 2. Under **Download custom model or LoRA**, enter `TheBloke/Inkbot-13B-8k-0.2-GPTQ`.
+    - To download from a specific branch, enter for example `TheBloke/Inkbot-13B-8k-0.2-GPTQ:gptq-4bit-32g-actorder_True`
+    - see Provided Files above for the list of branches for each option.
 3. Click **Download**.
 4. The model will start downloading. Once it's finished it will say "Done".
 5. In the top left, click the refresh icon next to **Model**.
 6. In the **Model** dropdown, choose the model you just downloaded: `Inkbot-13B-8k-0.2-GPTQ`
 7. The model will automatically load, and is now ready for use!
 8. If you want any custom settings, set them and then click **Save settings for this model** followed by **Reload the Model** in the top right.
+    - Note that you do not need to and should not set manual GPTQ parameters any more. These are set automatically from the file `quantize_config.json`.
+9. Once you're ready, click the **Text Generation** tab and enter a prompt to get started!
 <!-- README_GPTQ.md-text-generation-webui end -->
 Example Docker parameters:
 ```shell
+--model-id TheBloke/Inkbot-13B-8k-0.2-GPTQ --port 3000 --quantize gptq --max-input-length 3696 --max-total-tokens 4096 --max-batch-prefill-tokens 4096
 ```
 Example Python code for interfacing with TGI (requires huggingface-hub 0.17.0 or later):