ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text • Updated about 8 hours ago • 5.17k • 61
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 20 days ago • 105
bartowski/uncensoredai_UncensoredLM-DeepSeek-R1-Distill-Qwen-14B-GGUF Text Generation • Updated 15 days ago • 52.5k • 18