MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32 Text Generation • Updated Jan 3 • 7
MNC-LLM/batch1_epochs1_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16 Text Generation • Updated Dec 12, 2023 • 7
MNC-LLM/Mistral-7B-NWS-u2k-Marcoroni-prompt-found-LaAdMoAl-ep4lr5 Text Generation • Updated Dec 12, 2023 • 7
MNC-LLM/Mistral-7B-NWS-u2k-merge-Marcoroni-LaAdMoAl-ep4-lr5 Text Generation • Updated Dec 11, 2023 • 9
MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16 Text Generation • Updated Dec 11, 2023 • 9
MNC-LLM/batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32 Updated Dec 11, 2023