Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sileod
/
deberta-v3-large-tasksource-rlhf-reward-model
like
11
Text Classification
Transformers
PyTorch
Anthropic/hh-rlhf
English
deberta-v2
rlhf
Eval Results
Inference Endpoints
arxiv:
2204.05862
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
main
deberta-v3-large-tasksource-rlhf-reward-model
/
README.md
Commit History
Update README.md
2787455
sileod
commited on
Mar 28, 2023
Update README.md
683961f
sileod
commited on
Mar 28, 2023
Update README.md
e226218
sileod
commited on
Mar 28, 2023
Update README.md
d60ef35
sileod
commited on
Mar 28, 2023
Create README.md
052ab03
sileod
commited on
Mar 28, 2023