Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thewordsmiths
/
Meta-Llama-3-8B_sft_LoRA_100000_dpo_merged
like
0
Follow
The Wordsmiths
3
Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
unsloth
trl
dpo
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Meta-Llama-3-8B_sft_LoRA_100000_dpo_merged
Commit History
Upload model trained with Unsloth
0fc68a3
verified
paultltc
commited on
Jun 3, 2024
Upload model trained with Unsloth
4254ee9
verified
paultltc
commited on
Jun 3, 2024
Upload README.md with huggingface_hub
a6fed0a
verified
paultltc
commited on
Jun 3, 2024
initial commit
9b959a6
verified
paultltc
commited on
Jun 3, 2024