lm-human-preference-details/train_policy_accelerate_tf_adam_cerebras_gpt_111M__descriptiveness_offline_5k.json__seed5 Text Generation • Updated Oct 5, 2023 • 2
lm-human-preference-details/train_policy_accelerate_tf_adam_cerebras_gpt_111M__descriptiveness_offline_5k.json__seed1 Text Generation • Updated Oct 5, 2023 • 2
lm-human-preference-details/train_policy_accelerate_tf_adam_cerebras_gpt_111M__descriptiveness_offline_5k.json__seed3 Text Generation • Updated Oct 5, 2023 • 2
SebastianSchramm/Cerebras-GPT-111M-instruction-sft-lora-merged-dpo-lora-merged Text Generation • Updated Nov 18, 2023 • 2
smcwp/Cerebras-GPT-256M_DP_w_peft_adapterCasualConversation_1000_neft_alpha_50000_max_grad_3000 Updated Mar 8 • 2
lm-human-preference-details/train_policy_accelerate_tf_adam_cerebras_gpt_111M__descriptiveness_offline_5k.json__seed2 Text Generation • Updated Oct 5, 2023 • 1