Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
dfurman
/
Llama-3-8B-Orpo-v0.1
like
1
Text Generation
Transformers
Safetensors
mlabonne/orpo-dpo-mix-40k
English
llama
orpo
llama 3
rlhf
sft
conversational
Eval Results
text-generation-inference
Inference Endpoints
License:
llama3
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
f4b059b
Llama-3-8B-Orpo-v0.1
/
README.md
Commit History
Adding Evaluation Results
f4b059b
verified
leaderboard-pr-bot
commited on
Sep 17
Update README.md
f02aef8
verified
dfurman
commited on
Apr 29
Update README.md
7d94c1e
verified
dfurman
commited on
Apr 29
Update README.md
3d99bce
verified
dfurman
commited on
Apr 29
Update README.md
9203a3b
verified
dfurman
commited on
Apr 29
Update README.md
62dbdda
verified
dfurman
commited on
Apr 29
Update README.md
3764f1d
verified
dfurman
commited on
Apr 28
Update README.md
e34c520
verified
dfurman
commited on
Apr 28
Upload LlamaForCausalLM
3f07b4b
verified
dfurman
commited on
Apr 28
Update README.md
79a5c2b
verified
dfurman
commited on
Apr 28
Update README.md
a9a7ab8
verified
dfurman
commited on
Apr 26
Update README.md
6a5c74a
verified
dfurman
commited on
Apr 26
Update README.md
9e764e3
verified
dfurman
commited on
Apr 26
Create README.md
46590e2
verified
dfurman
commited on
Apr 26
Upload LlamaForCausalLM
30a2f10
verified
dfurman
commited on
Apr 26