Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
UCLA-AGI
/
Llama-3-Instruct-8B-SPPO-Iter1
like
0
Follow
UCLA Artificial General Intelligence Lab
59
Text Generation
Transformers
Safetensors
openbmb/UltraFeedback
English
llama
conversational
text-generation-inference
Inference Endpoints
arxiv:
2405.00675
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
2076437
Llama-3-Instruct-8B-SPPO-Iter1
Commit History
Update README.md
2076437
verified
angelahzyuan
commited on
Jun 25
Update README.md
0c324fb
verified
angelahzyuan
commited on
Jun 25
Upload tokenizer
163b412
verified
angelahzyuan
commited on
Jun 25
Upload LlamaForCausalLM
b7ba939
verified
angelahzyuan
commited on
Jun 25
initial commit
b7d22e6
verified
angelahzyuan
commited on
Jun 25