Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
berkeley-nest
/
Starling-LM-7B-alpha
like
553
Follow
Berkeley-Nest
61
Text Generation
Transformers
Safetensors
berkeley-nest/Nectar
English
mistral
reward model
RLHF
RLAIF
conversational
text-generation-inference
Inference Endpoints
arxiv:
2306.02231
License:
apache-2.0
Model card
Files
Files and versions
Community
32
Train
Deploy
Use this model
9dc75f5
Starling-LM-7B-alpha
/
tokenizer_config.json
Commit History
Add OpenChat3.5 chat template (
#6
)
16019ae
banghua
khu
commited on
Nov 28, 2023
Duplicate from banghua/openchat-3.5-ppo-ckpt-6k
fa6178c
banghua
commited on
Nov 25, 2023