Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lomahony
/
pythia-2.8b-helpful-dpo
like
0
Text Generation
Transformers
PyTorch
Anthropic/hh-rlhf
English
gpt_neox
causal-lm
pythia
text-generation-inference
Inference Endpoints
arxiv:
2101.00027
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
54274f7
pythia-2.8b-helpful-dpo
Commit History
Update README.md
54274f7
verified
lomahony
commited on
May 14
Update README.md
7029094
verified
lomahony
commited on
Feb 15
Update README.md
0312b1e
verified
lomahony
commited on
Jan 16
Update README.md
454bbdb
verified
lomahony
commited on
Jan 16
Create README.md
5693378
verified
lomahony
commited on
Jan 16
Upload pytorch_model.bin
f375b28
verified
lomahony
commited on
Jan 12
Upload config.json
2b25b46
verified
lomahony
commited on
Jan 12
Upload special_tokens_map.json
a4728b1
verified
lomahony
commited on
Jan 12
Upload policy.pt
a9a5020
verified
lomahony
commited on
Jan 12
Upload generation_config.json
da467f4
verified
lomahony
commited on
Jan 12
Upload tokenizer_config.json
bb65c1f
verified
lomahony
commited on
Jan 12
Upload tokenizer.json
f897593
verified
lomahony
commited on
Jan 12
Upload pytorch_model.bin
c43a84d
verified
lomahony
commited on
Jan 12
Upload config.json
7914f62
verified
lomahony
commited on
Jan 12
Upload special_tokens_map.json
32768e5
verified
lomahony
commited on
Jan 12
Upload policy.pt
37b1c58
verified
lomahony
commited on
Jan 12
Upload generation_config.json
038e4c2
verified
lomahony
commited on
Jan 12
Upload tokenizer_config.json
32494bb
verified
lomahony
commited on
Jan 12
Upload tokenizer.json
b5f97a5
verified
lomahony
commited on
Jan 12
initial commit
2161d94
verified
lomahony
commited on
Jan 12