lmms-lab/LLaVA-Critic-R1-7B-Plus-Mimo
8B • Updated • 8
• 1
lmms-lab/LLaVA-Critic-R1-7B-LLaMA32v
11B • Updated • 3
mradermacher/LLaVA-Critic-R1-7B-Plus-Mimo-GGUF
8B • Updated • 108
critical12/tourism-purchase-predictor-rf
Tabular Classification
• Updated mcemri/qwen2.5-3b-rl-cut-agent-ppo-taxonomy-step40-v0-critic
mcemri/qwen2.5-3b-rl-cut-agent-ppo-taxonomy-step200-v0-critic
mcemri/qwen2.5-3b-rl-cut-agent-ppo-taxonomy-step100-v0-critic
mcemri/qwen2.5-3b-rl-cut-agent-ppo-pure-step100-v0-critic
Updated
mcemri/qwen2.5-3b-rl-cut-agent-ppo-oure-step100-v0-critic
mcemri/qwen2.5-3b-rl-cut-agent-ppo-pure-step40-v0-critic
Updated
mcemri/qwen2.5-3b-rl-cut-agent-ppo-oure-step40-v0-critic
anirudhb11/critic_800_Qwen-1.5B-Instruct-ppo-run-math-training-prompt-len-800-response-len-4096-0c4e7a4810
Text Classification
• 2B • Updated • 2
anirudhb11/critic_600_Qwen-1.5B-Instruct-ppo-run-math-training-prompt-len-800-response-len-4096-c996b35f5b
Text Classification
• 2B • Updated • 3
anirudhb11/critic_400_Qwen-1.5B-Instruct-ppo-run-math-training-prompt-len-800-response-len-4096-1fa2c957e8
Text Classification
• 2B • Updated • 3
anirudhb11/critic_200_Qwen-1.5B-Instruct-ppo-run-math-training-prompt-len-800-response-len-4096-41d35e300a
Text Classification
• 2B • Updated • 3
anirudhb11/critic_16_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-5000-0f12cee30e
2B • Updated • 2
anirudhb11/critic_450_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-5000-cb2a139532
Text Classification
• 2B • Updated • 2
anirudhb11/critic_250_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-5000-f4747737d3
Text Classification
• 2B • Updated • 2
anirudhb11/critic_50_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-5000-9dccd3e81f
Text Classification
• 2B • Updated • 2
anirudhb11/critic_450_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-90c3ddec29
Text Classification
• 2B • Updated • 2
anirudhb11/critic_250_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-6394f91930
Text Classification
• 2B • Updated • 2
anirudhb11/critic_50_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-a-acd8a5ca05
Text Classification
• 2B • Updated • 2
anirudhb11/critic_450_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-733e92f07a
Text Classification
• 2B • Updated • 2
anirudhb11/critic_250_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-517c4b730f
2B • Updated • 2
anirudhb11/critic_50_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-500-o-133c7c3c62
2B • Updated • 2
anirudhb11/critic_16_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-128-b-16249eaad4
2B • Updated • 2
anirudhb11/critic_450_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-128-a20b868cff
Text Classification
• 2B • Updated • 2
anirudhb11/critic_250_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-128-4d82e69430
Text Classification
• 2B • Updated • 2
anirudhb11/critic_50_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-128-b-3c89517c58
Text Classification
• 2B • Updated • 2
anirudhb11/critic_800_ppo-run-math-training-prompt-len-800-response-len-4096-seed-43-subset-1000-3c88380bef
Text Classification
• 2B • Updated • 2