YunxinLi/pretrain_BART_generator_coldstart_OFA
Updated
Coldstart/Llama-3.1-8B-Instruct-Surfer-Dude-Personality
Updated
Coldstart/Llama-3.1-8B-Instruct-Hillbilly-Personality
Updated
joey00072/Llama-3.2-1B-Instruct-cold-start-ft
Text Generation
• 1B • Updated • 10
CodeDPO/qwen2.5-coder-inst-cold-start-R1
8B • Updated • 2
joey00072/Llama-3.2-1B-Instruct-cold-start-ft2
Text Generation
• 1B • Updated • 68
CodeDPO/qwen25-coder-inst-7b-reinforce-plus_v2_mini_processed_r1_cold_start
8B • Updated • 4
tensorblock/Llama-3.2-1B-Instruct-cold-start-ft2-GGUF
winglian/reasoning-llama-3.1-70b-stratos-cold-start
Text Generation
• 71B • Updated • 5
winglian/reasoning-llama-3.1-8b-stratos-cold-start
Text Generation
• 1B • Updated • 3
winglian/reasoning-llama-3.1-8b-stratos-cold-start-v2
Text Generation
• 8B • Updated • 5
• 1
winglian/reasoning-llama-3.1-70b-stratos-cold-start-v2
Text Generation
• 71B • Updated • 1
mradermacher/reasoning-llama-3.1-70b-stratos-cold-start-v2-GGUF
71B • Updated • 45
ak36/qwen1.5b-coldstart-rl
2B • Updated • 3
ak36/qwen0.5b-coldstart-rl
0.5B • Updated • 2
Text Generation
• 3B • Updated • 4
datasciencesage/coldstartmodel
Updated
Zhang199/TinyLLaVA-Video-Coldstart_NextQA_16
Video-Text-to-Text
• 4B • Updated • 9
• 1
SunshineAndRain/Clinical-R1-3B-Cold-Start
Text Generation
• 3B • Updated • 2
• mradermacher/reasoning-llama-3.1-70b-stratos-cold-start-v2-i1-GGUF
71B • Updated • 40
Spencerbot15/Qwen-1.5B-MTG-Drafting-Coldstart
Updated
Elfsong/Qwen2.5-Coder-3B-Instruct-Venus-Cold-Start
Text Generation
• 3B • Updated • 4
wzq016/qwen2.5_32B_LR8.0e-7_filtered_sky_code_8k_math_10k_cold_start_same_setting_4k8k_0501
33B • Updated • 3
tinycompany/Qwentify-Cold-start-adibun-CoT
Text Generation
• 2B • Updated • 2
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_shuffle_0510
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_learn08_bleu02_0510
Updated
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn16_learn10_0510
swordfaith/ReTool-Qwen3-4B-SFT-cold-started
Text Generation
• 4B • Updated • 5
• 1
mm-o1/qwen2_5_vl_7b_mmr1_coldstartv10_rlv9_wllm_response4k_rolloutn32_shuffle_0510