Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
hansh
/
hansken_human_hql_v3
like
0
PEFT
TensorBoard
Safetensors
hansh/hansken_hql_cot
llama
alignment-handbook
trl
sft
Generated from Trainer
4-bit precision
bitsandbytes
License:
llama3.1
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
hansken_human_hql_v3
/
runs
/
Sep19_14-09-12_938RL43
1 contributor
History:
11 commits
hansh
End of training
7c3e09e
verified
29 days ago
events.out.tfevents.1726747893.938RL43.482299.0
Safe
208 kB
LFS
Training in progress, epoch 10
29 days ago
events.out.tfevents.1726861191.938RL43.482299.1
Safe
359 Bytes
LFS
End of training
29 days ago