lole25
/

phi-2-gpo-ultrachat-lora-0.1

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

phi-2-gpo-ultrachat-lora-0.1 / runs /Feb29_16-55-01_gpu4-119-4

1 contributor

History: 1 commit

lole25's picture

Model save

e412c7e verified 9 months ago

events.out.tfevents.1709186197.gpu4-119-4.1081689.0

11.4 kB
LFS

Model save 9 months ago
events.out.tfevents.1709187625.gpu4-119-4.1081689.1

815 Bytes
LFS

Model save 9 months ago