-
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 109 -
S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
Paper • 2502.12853 • Published • 29 -
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Paper • 2503.05592 • Published • 25 -
Self-Taught Self-Correction for Small Language Models
Paper • 2503.08681 • Published • 12
Shreyas S K
skshreyas714
·
AI & ML interests
NLP, NLU, NLI
Recent Activity
updated
a collection
8 days ago
Read-up research papers
published
a model
9 days ago
skshreyas714/prompt-guard-finetuned
updated
a collection
12 days ago
Read-up research papers
Organizations
None yet