Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
dfurman
/
Llama-3-8B-Orpo-v0.1
like
1
Text Generation
Transformers
Safetensors
mlabonne/orpo-dpo-mix-40k
English
llama
orpo
llama 3
rlhf
sft
conversational
Eval Results
text-generation-inference
Inference Endpoints
License:
llama3
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
9e764e3
Llama-3-8B-Orpo-v0.1
Commit History
Update README.md
9e764e3
verified
dfurman
commited on
Apr 26
Create README.md
46590e2
verified
dfurman
commited on
Apr 26
Upload tokenizer
977334f
verified
dfurman
commited on
Apr 26
Upload LlamaForCausalLM
30a2f10
verified
dfurman
commited on
Apr 26
initial commit
dbe98c4
verified
dfurman
commited on
Apr 26