SebastianSchramm/Cerebras-GPT-111M-instruction-GPTQ-4bit-128g-actorder_True Text Generation • Updated Aug 26, 2023 • 1.36k • 1
smcwp/Cerebras-GPT-256M_DP_w_peft_adapterCasualConversation_1000_neft_alpha_50000_max_grad_3000 Updated Mar 8 • 12
SebastianSchramm/Cerebras-GPT-111M-instruction-sft-lora-merged-dpo-lora Text Generation • Updated Nov 18, 2023 • 3
SebastianSchramm/Cerebras-GPT-111M-instruction-sft-lora-merged Text Generation • Updated Nov 18, 2023 • 2
claysauruswrecks/cerebras-gpt-111m-pretrain-stack-smol-0-15k-chkp Text Generation • Updated Apr 25, 2023 • 1
claysauruswrecks/cerebras-gpt-111m-pretrain-stack-smol-1-30k-2e Text Generation • Updated May 9, 2023 • 1
lm-human-preference-details/train_policy_accelerate_tf_adam_cerebras_gpt_111M__descriptiveness_offline_5k.json__seed5 Text Generation • Updated Oct 5, 2023 • 1