Text Generation
PEFT
lora
dpo
nomen-ai

Nomen-AI DPO LoRA Adapter

Target output repo for the DPO anti-derivative phase of Nomen-AI.

Training command:

export SFT_ADAPTER=krystv/nomen-ai-sft-lora
export HUB_MODEL_ID=krystv/nomen-ai-dpo-lora
python scripts/train_dpo.py

Main pipeline repo: https://huggingface.co/krystv/nomen-ai

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for krystv/nomen-ai-dpo-lora

Adapter
(1014)
this model

Dataset used to train krystv/nomen-ai-dpo-lora

Space using krystv/nomen-ai-dpo-lora 1