XueyingJia
/

pythia-1b-online-dpo-SG-merge-llama-judge-test-resume

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pythia-1b-online-dpo-SG-merge-llama-judge-test-resume / tokenizer.json

XueyingJia's picture

Training in progress, step 1500

f3a9388 verified 18 days ago

history contribute delete

3.56 MB

File too large to display, you can check the raw version instead.