Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tanliboy
/
lambda-llama-3-8b-ipo-test

Text Generation
Transformers
TensorBoard
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Model card Files Files and versions Metrics Training metrics Community
lambda-llama-3-8b-ipo-test / runs
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
tanliboy's picture
tanliboy
End of training
070dabb verified 8 months ago
  • Sep21_18-21-43_action-graph-trainer
    Model save 8 months ago
  • Sep21_19-14-54_action-graph-trainer
    End of training 8 months ago