Dampish
/

Retry-dante-2.8B-V2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Retry-dante-2.8B-V2 / README.md

Dampish's picture

Update README.md

e45bf89 over 1 year ago

|

1.38 kB

	---
	license: cc-by-nc-sa-4.0
	---
	![Concept Art](cc.png)
	Model trained to be as helpful of an assistant as possible.

	### Data split percentage
	60% coding
	10% conversations
	20% instructions
	10% Roleplay

	Now obviously the data contains other elements but this is the biggest part

	### Examples
	```
	The prompt structure can be almost anything, this model has been trained on 0.6 million instruction, which is less than dante.
	This model has however been trained on a much cleaner and organized dataset and has been retrained multiple times to reach as low
	of a training loss as possible.

	## Examples.

	"You are an AI assistant respond to human in a helpful manner.
	HM: What were the causes for world war 2?
	"

	"Act like a detective from the 1900s, respond to mike in a helpful manner.
	HM: What were the causes for world war 2?
	"

	The prompt also works with alpaca structure. I have purposefully trained it so that it should work like this.
	EOS token is <\|end\|>.

	Remember to tell it how it should act for best effect.
	```

	### More information
	The base model is GPT-NeoX taken pretrained by redpajama.

	We managed to reach 0.45 Validation loss with a 0.3 learning loss.

	You are not allowed to use this for commerical purposes unless reaching an agreement with the creator @Dampish, @Dampish#3607 on discord.





	The model can easily be further fine tuned to most languages.