zapqqqwe/save_mcts_evol_v1_1k
zd21/ReST-MCTS-Llama3-8b-Instruct-Policy-1st
Text Generation
• 8B • Updated • 9
• 5
RobbiePasquale/gpt-moe-mcts
micaebe/Qwen2.5-0.5B-MCTS-Value-Net
Text Classification
• 0.5B • Updated • 9
B41425/alphazero-gobang9x9-mcts400-iteration-0118
Updated
cjfcsjt/142_webshopv_sft300_3hist_dpo_mcts_demons_acprob_maxmin
Updated
cjfcsjt/train_mcts_webshopv_sft300_rft
Updated
cjfcsjt/train_mcts_webshopv-sft300-3hist_demons_acprob_niter3_v2
Updated
oceanpty/TOA-ultrafeedback-lla3-8b-inst-dpo-data-small-scale-mcts-n-40-pi-0-ni-30
8B • Updated • 2
MCTS-Refine/MCTS-Refine-7B
8B • Updated • 2
• 1
gsarch/ViGoRL-MCTS-SFT-3b-Web-Grounding
Image-Text-to-Text
• 4B • Updated • 4
gsarch/ViGoRL-MCTS-SFT-7b-Web-Grounding
Image-Text-to-Text
• 8B • Updated • 5
gsarch/ViGoRL-Multiturn-MCTS-SFT-3b-Web-Grounding
Image-Text-to-Text
• 4B • Updated • 454
• 1
gsarch/ViGoRL-Multiturn-MCTS-SFT-3b-Visual-Search
4B • Updated • 521
gsarch/ViGoRL-Multiturn-MCTS-SFT-7b-Visual-Search
Image-Text-to-Text
• 8B • Updated • 3
gsarch/ViGoRL-MCTS-SFT-3b-Spatial
Image-Text-to-Text
• 4B • Updated • 108
• 1
gsarch/ViGoRL-MCTS-SFT-7b-Spatial
Image-Text-to-Text
• 8B • Updated • 4
mradermacher/ViGoRL-MCTS-SFT-3b-Spatial-GGUF
3B • Updated • 20
mradermacher/ViGoRL-MCTS-SFT-3b-Spatial-i1-GGUF
3B • Updated • 73
mradermacher/ViGoRL-MCTS-SFT-7b-Spatial-GGUF
8B • Updated • 37
mradermacher/ViGoRL-MCTS-SFT-7b-Spatial-i1-GGUF
8B • Updated • 69
mradermacher/MCTS-Refine-7B-GGUF
8B • Updated • 22
Hiyo1256/chess-mcts-models
Updated