HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-5-gamma-share-expert Text Generation • 16B • Updated 8 days ago • 20 • 1
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5-share-expert Text Generation • 14B • Updated 8 days ago • 24
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-1e-5 Text Generation • 14B • Updated 8 days ago • 20
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-5-gamma Text Generation • 16B • Updated 8 days ago • 8
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5-share-expert Text Generation • 14B • Updated 8 days ago • 9
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-3e-5 Text Generation • 14B • Updated 8 days ago • 5
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-3e-5 Text Generation • 7B • Updated 9 days ago • 9
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-1e-5 Text Generation • 7B • Updated 9 days ago • 6
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma-share-expert Text Generation • 16B • Updated 9 days ago • 24 • 1
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-share-expert Text Generation • 7B • Updated 9 days ago • 20 • 1
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma Text Generation • 16B • Updated 9 days ago • 22
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-share-experts Text Generation • 14B • Updated 9 days ago • 15 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-3-gamma-part2 Text Generation • 14B • Updated 12 days ago • 22
HectorHe/DeepSeek-V2-Lite-aux-free-sft-math7k-1epoch-1e-4-gamma-share-experts-2nd-epoch-high-bias-expert Text Generation • 16B • Updated 12 days ago • 9