YYYYYYibo
/

full_vanilla_dpo_iter_1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

full_vanilla_dpo_iter_1 / tokenizer.json

YYYYYYibo's picture

Training in progress, step 50

7e8f6a0 verified 4 months ago

history contribute delete

1.8 MB

File too large to display, you can check the raw version instead.