Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ContextualAI
/
archangel_dpo_llama13b
like
0
Follow
ContextualAI
53
Text Generation
Transformers
Safetensors
stanfordnlp/SHP
Anthropic/hh-rlhf
OpenAssistant/oasst1
English
llama
human feedback
rlhf
preferences
alignment
HALO
halos
dpo
rl
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
3a20471
archangel_dpo_llama13b
Commit History
Upload tokenizer
3a20471
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
a6d015d
xwinxu
commited on
Jan 8
Upload README.md with huggingface_hub
56b9991
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
5cb318f
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
7177a60
xwinxu
commited on
Dec 7, 2023
Upload README.md with huggingface_hub
0632eec
xwinxu
commited on
Dec 6, 2023
Upload README.md with huggingface_hub
2d64b44
xwinxu
commited on
Dec 6, 2023
Upload LlamaForCausalLM
c71fd3a
stas
commited on
Nov 25, 2023
Upload tokenizer
abfa16a
stas
commited on
Nov 25, 2023
initial commit
9443bc7
stas
commited on
Nov 25, 2023