Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Ray2333
/
gpt2-large-harmless-reward_model
like
1
Text Classification
Transformers
Safetensors
Anthropic/hh-rlhf
gpt2
Inference Endpoints
text-generation-inference
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
Ray2333
commited on
Jan 15
Commit
d59cbec
•
1 Parent(s):
d2d8717
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+5
-1
README.md
CHANGED
Viewed
@@ -1,3 +1,7 @@
1
---
2
license: mit
3
-
---
1
---
2
license: mit
3
+
datasets:
4
+
- Anthropic/hh-rlhf
5
+
metrics:
6
+
- accuracy
7
+
---