Text Generation
Transformers
Safetensors
stablelm
alignment-handbook
trl
orpo
Generated from Trainer
conversational
stablelm-2-1_6b-orpo-full-v3 / tokenizer_config.json

Commit History

Training in progress, step 100
703dc7c
verified

vain05 commited on