Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
OpenAssistant
/
reward-model-deberta-v3-large
like
20
Text Classification
Transformers
PyTorch
openai/summarize_from_feedback
openai/webgpt_comparisons
Dahoas/instruct-synthetic-prompt-responses
English
deberta-v2
reward-model
reward_model
RLHF
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
d19d4ef
reward-model-deberta-v3-large
2 contributors
History:
4 commits
theblackcat102
Update README.md
d19d4ef
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
2.47 kB
Update README.md
over 1 year ago
added_tokens.json
23 Bytes
Upload 13 files
over 1 year ago
config.json
991 Bytes
Upload 13 files
over 1 year ago
optimizer.pt
3.48 GB
LFS
Upload 13 files
over 1 year ago
pytorch_model.bin
1.74 GB
LFS
Upload 13 files
over 1 year ago
rng_state.pth
14.6 kB
LFS
Upload 13 files
over 1 year ago
scaler.pt
559 Bytes
LFS
Upload 13 files
over 1 year ago
scheduler.pt
559 Bytes
LFS
Upload 13 files
over 1 year ago
special_tokens_map.json
173 Bytes
Upload 13 files
over 1 year ago
spm.model
2.46 MB
LFS
Upload 13 files
over 1 year ago
tokenizer.json
8.66 MB
Upload 13 files
over 1 year ago
tokenizer_config.json
455 Bytes
Upload 13 files
over 1 year ago
trainer_state.json
39 kB
Upload 13 files
over 1 year ago
training_args.bin
3.44 kB
LFS
Upload 13 files
over 1 year ago