Alsebay
/

NarumashiRTS-7B-V2-1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

NarumashiRTS-7B-V2-1 / README.md

Alsebay's picture

Update README.md

b93cb21 verified 4 months ago

|

history blame contribute delete

No virus

1.45 kB

	---
	language:
	- en
	license: cc-by-nc-4.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- mistral
	- trl
	- sft
	- Roleplay
	- roleplay
	base_model: SanjiWatsuki/Kunoichi-DPO-v2-7B
	---
	> [!Important]
	> Still in experiment
	# About this model

	Remake [version 2](https://huggingface.co/Alsebay/NarumashiRTS-V2) with safetensor format, more safety and stable method, nothing change too much (base on the model hash). But to be real, in the previous version 2, I used unsafety method to save pretrain model, which could lead apply Lora layer twice to model, that make model have terrible performance. (Thanks Unsloth community told me about this :D )

	- Finetuned with rough translate dataset, to increase the accuracy in TSF theme, which is not quite popular. (lewd dataset)
	- Finetuned from model : SanjiWatsuki/Kunoichi-DPO-v2-7B . Thank SanjiWatsuki a lot :)
	## GGUF version? [Here](https://huggingface.co/mradermacher/NarumashiRTS-7B-V2-1-GGUF). Thank you, mradermacher!
	## V2 have more epochs.
	## Dataset
	```
	Dataset(all are novels):
	30% skinsuit
	30% possession
	35% transform(shapeshift)
	5% other
	```

	# Thank Unsloth for good finetuning tool. This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)