Text Generation
scaling
sp-baseline-research-1b-bf16 / model_state_layer_18_TransformerLMHead.pt

Commit History