cleanrl/EleutherAI_pythia-1b-deduped__sft__tldr
Text Generation
•
Updated
•
6.25k
The checkpoints are trained in https://arxiv.org/abs/2403.17031 and taken from https://wandb.ai/costa-huang/tldr_summarize/reports/Release--Vmlldzo3MT