Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Salesforce
/
LLaMA-3-8B-SFR-Iterative-DPO-R
like
73
Text Generation
Transformers
Safetensors
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2405.07863
arxiv:
2312.11456
License:
llama3
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
10e1a7b
LLaMA-3-8B-SFR-Iterative-DPO-R
Commit History
Update README.md
10e1a7b
verified
bpucla
commited on
May 31
Update README.md
5358809
verified
bpucla
commited on
May 31
Update README.md
6ecb6b8
verified
hendrydong
commited on
May 14
Update README.md
4871bd2
verified
hendrydong
commited on
May 14
Upload tokenizer
cdaa737
verified
hendrydong
commited on
May 14
Upload LlamaForCausalLM
8273159
verified
hendrydong
commited on
May 14
Upload model-00001-of-00004.safetensors
b4f6270
verified
bpucla
commited on
May 13
Upload 6 files
a528b4b
verified
bpucla
commited on
May 12
Update README.md
b715281
verified
bpucla
commited on
May 10
Update README.md
fc5d28a
verified
bpucla
commited on
May 10
Update README.md
15a287e
verified
bpucla
commited on
May 10
Update README.md
32d34de
verified
bpucla
commited on
May 10
initial commit
6307193
verified
bpucla
commited on
May 9