Epiculous
/

Azure_Dusk-v0.1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Epiculous commited on Aug 24

Commit

448e772

•

1 Parent(s): cb23f8e

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -39,6 +39,9 @@ If you are using GGUF I strongly advise using ChatML, for some reason that quant
 [Crimson_Dawn-Nitral-Special](https://files.catbox.moe/8xjxht.json) - Considered the best settings! <br/>
 [Crimson_Dawn-Magnum-Style](https://files.catbox.moe/lc59dn.json)
 ## Training
 Training was done twice over 2 epochs each on two 2x [NVIDIA A6000 GPUs](https://www.nvidia.com/en-us/design-visualization/rtx-a6000/) using LoRA. A two-phased approach was used in which the base model was trained 2 epochs on Instruct data, the LoRA was then applied to base. Finally, the new modified base was trained 2 epochs on RP, and the new RP LoRA was applied to the modified base, resulting in what you see here.

 [Crimson_Dawn-Nitral-Special](https://files.catbox.moe/8xjxht.json) - Considered the best settings! <br/>
 [Crimson_Dawn-Magnum-Style](https://files.catbox.moe/lc59dn.json)
+### Tokenizer
+If you are using SillyTavern, please set the tokenizer to API (WebUI/ koboldcpp)
 ## Training
 Training was done twice over 2 epochs each on two 2x [NVIDIA A6000 GPUs](https://www.nvidia.com/en-us/design-visualization/rtx-a6000/) using LoRA. A two-phased approach was used in which the base model was trained 2 epochs on Instruct data, the LoRA was then applied to base. Finally, the new modified base was trained 2 epochs on RP, and the new RP LoRA was applied to the modified base, resulting in what you see here.