north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_29999 3B • Updated Aug 13, 2025 • 1
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_5000 3B • Updated Aug 13, 2025 • 1
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_10000 3B • Updated Aug 14, 2025 • 2
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_15000 3B • Updated Aug 14, 2025 • 2
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_20000 3B • Updated Aug 16, 2025 • 2
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_25000 3B • Updated Aug 16, 2025 • 2
north/north_llama32_3b_enhancedNCC_instruct_v1_e3_all_but_knowledge_mc_qa_lr2e-6_2048_v2_30000 3B • Updated Aug 16, 2025 • 2
Nishef/SmolLM2-360M-Full_KNOWLEDGE_RETAINING_ENHANCED_KTO_20251227_151509 Text Generation • Updated Dec 27, 2025 • 4
Nishef/SmolLM2-360M-Full_KNOWLEDGE_RETAINING_ENHANCED_KTO_20251227_151509-merged Text Generation • 0.4B • Updated Dec 27, 2025 • 2