jyc0325
/

Mistral-7B-v0.3-sft-ultrachat-hhrlhf-dpo

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mistral-7B-v0.3-sft-ultrachat-hhrlhf-dpo / last-checkpoint /latest

jyc0325's picture

Training in progress, epoch 3, checkpoint

5e254f4 verified 5 days ago

history blame contribute delete

14 Bytes

global_step939