DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 13 days ago • 71
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • Updated 6 days ago • 24.4k • • 95
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 12 days ago • 52