Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ENERGY-DRINK-LOVE
/
SOLAR_merge_DPOv3
like
0
Text Generation
Transformers
Safetensors
llama
trl
dpo
generated_from_trainer
conversational
Inference Endpoints
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
Edit model card
Model
Base Moel
Framework versions
Model
trained on custom DPO dataset
dedup
~20000??
Base Moel
ENERGY-DRINK-LOVE/SOLAR_merge
Framework versions
Transformers 4.38.1
Pytorch 2.2.1+cu118
Datasets 2.17.1
Tokenizers 0.15.2
Downloads last month
1,269
Safetensors
Model size
10.7B params
Tensor type
BF16
·
Finetuned from
ENERGY-DRINK-LOVE/SOLAR_merge
Evaluation results
Metadata error: specify a dataset to view leaderboard