Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
sileod
/
deberta-v3-large-tasksource-rlhf-reward-model
like
11
Text Classification
Transformers
PyTorch
Anthropic/hh-rlhf
English
deberta-v2
rlhf
Eval Results
Inference Endpoints
arxiv:
2204.05862
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
052ab03
deberta-v3-large-tasksource-rlhf-reward-model
Commit History
Create README.md
052ab03
sileod
commited on
Mar 28, 2023
Upload tokenizer
bc9d816
sileod
commited on
Mar 28, 2023
Upload DebertaV2ForMultipleChoice
99b1e35
sileod
commited on
Mar 28, 2023
initial commit
d44f6e4
sileod
commited on
Mar 28, 2023