Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rinna
/
japanese-gpt-neox-3.6b-instruction-ppo
like
70
Text Generation
Transformers
PyTorch
Safetensors
Anthropic/hh-rlhf
Japanese
gpt_neox
lm
nlp
text-generation-inference
arxiv:
2203.02155
arxiv:
1707.06347
arxiv:
2404.01657
License:
mit
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
d179df6
japanese-gpt-neox-3.6b-instruction-ppo
/
model.safetensors
Commit History
Adding `safetensors` variant of this model
d179df6
SFconvertbot
commited on
Aug 24, 2023