Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

trl-lib
/
pythia-1b-deduped-tldr-online-dpo

TensorBoard
Safetensors
gpt_neox
Generated from Trainer
Model card Files Files and versions Metrics Training metrics Community
pythia-1b-deduped-tldr-online-dpo / runs /Jul09_19-14-53_ip-26-0-160-225
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
edbeeching's picture
edbeeching HF Staff
Add vwxyzjn/online_dpo_tldr-main checkpoint
83e2e55 verified 10 months ago
  • events.out.tfevents.1720552568.ip-26-0-160-225.441432.0
    75.8 kB
    LFS
    Add vwxyzjn/online_dpo_tldr-main checkpoint 10 months ago