Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-LM-7B-alpha
like
553
Follow
Berkeley-Nest
61
Text Generation
Transformers
Safetensors
berkeley-nest/Nectar
English
mistral
reward model
RLHF
RLAIF
conversational
text-generation-inference
Inference Endpoints
arxiv:
2306.02231
License:
apache-2.0
Model card
Files
Files and versions
Community
32
Train
Deploy
Use this model
a074a80
Starling-LM-7B-alpha
/
model-00001-of-00003.safetensors
Commit History
Duplicate from banghua/openchat-3.5-ppo-ckpt-6k
fa6178c
banghua
commited on
Nov 25, 2023