Text Generation
Transformers
Safetensors
English
mistral
Inference Endpoints
text-generation-inference
Edit model card

v1olet/v1olet_marcoroni-go-bruins-merge-7B trained for an epoch on my NSFW_DPO-v1 dataset, then the some LoRA state was trained until crash on DPO-v2 dataset (made private until I can figure it out), then again from that point on 1 more epoch of the NSFW_DPO-v1 dataset

Downloads last month
26
Safetensors
Model size
7.24B params
Tensor type
BF16
·

Dataset used to train athirdpath/NSFW_DPO_vmgb-7b