TingchenFu/RM_gpt2-large_HH_bf16_helpful0.01_bs32lr1.41e-5decay0.0cosine_07051702 Text Classification • Updated 5 days ago • 1
TingchenFu/RM_gpt2-large_HH_bf16_helpful0.02_bs32lr1.41e-5decay0.0cosine_07051338 Text Classification • Updated 5 days ago • 1
tinnguyen/gpt2_toxicity_reduction_finetuned_model__inference_refusal_prompt_engineer__30_epochs Updated 5 days ago
tinnguyen/gpt2_toxicity_reduction_finetuned_model__train_and_inference_refusal_prompt_engineer__30_epochs Updated 5 days ago
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.01_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated 5 days ago • 1
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.02_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated 5 days ago • 1
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.05_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated 5 days ago • 1
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.1_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated 5 days ago • 1