distillslm/alpaca_seq_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it Text Generation • Updated 1 day ago • 1
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-16k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 5
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_VD-QWQ-Noisy-Small-8k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 4
distillslm/alpaca_seq_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-32B-Instruct Text Generation • Updated 2 days ago • 1
secmlr/dpo_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_full Text Generation • Updated 2 days ago • 127
secmlr/VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 48
secmlr/VD-QWQ-Clean-8k_VD-QWQ-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 48
distillslm/alpaca_seq_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • Updated 2 days ago • 4
secmlr/VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 131
secmlr/VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5_train_dpo Viewer • Updated 2 days ago • 5.45k • 176
secmlr/VD-DS-Clean-8k_VD-DS-Clean-16k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 2 days ago • 131
secmlr/Sky-T1-Filtered_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 3 days ago • 1
secmlr/Sky-T1-Filtered_VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5 Text Generation • Updated 3 days ago • 1