Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
CallComply
/
Starling-LM-11B-alpha
like
12
Follow
Call Comply
10
Text Generation
Transformers
Safetensors
berkeley-nest/Nectar
English
mistral
reward model
RLHF
RLAIF
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2306.02231
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
5
Train
Deploy
Use this model
perlthoughts
commited on
Dec 5, 2023
Commit
fa1f6db
•
1 Parent(s):
f50e50a
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -11,7 +11,7 @@ tags:
11
- RLAIF
12
---
13
14
-
# Starling-
RM
-11B-alpha
15
16
Merge configuration with mergekit:
17
11
- RLAIF
12
---
13
14
+
# Starling-
LM
-11B-alpha
15
16
Merge configuration with mergekit:
17