33 81 93

Somshubra Majumdar

smajumdar94

AI & ML interests

None yet

Recent Activity

commented on a paper 2 days ago

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

updated a collection 2 days ago

OpenCodeReasoning

upvoted a paper 2 days ago

Inference-Time Scaling for Generalist Reward Modeling

View all activity

Organizations

smajumdar94's activity

commented a paper 2 days ago

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Paper • 2504.01943 • Published 4 days ago • 5 •

updated a collection 2 days ago

OpenCodeReasoning

Collection

Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 2 items • Updated 2 days ago • 1

upvoted a paper 2 days ago

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published 4 days ago • 28

updated a dataset 3 days ago

nvidia/OpenCodeReasoning

Viewer • Updated 2 days ago • 753k • 57 • 13

upvoted a paper 4 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 6 days ago • 58

upvoted a paper 5 days ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published 7 days ago • 17

liked a model 6 days ago

all-hands/openhands-lm-32b-v0.1

Text Generation • Updated 3 days ago • 3.62k • 282

upvoted 2 papers 11 days ago

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published 13 days ago • 15

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published 12 days ago • 42

upvoted a paper 12 days ago

Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

Paper • 2503.19855 • Published 12 days ago • 25

upvoted a paper 14 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 17 days ago • 46

liked 2 models 15 days ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • Updated 22 days ago • 11.3k • 101

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • Updated 17 days ago • 57.4k • 225

liked a Space 15 days ago

Canary 1B Flash

🐤

Canary 1B Flash demo

upvoted an article 18 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

20 days ago

• 33

liked a dataset 19 days ago

nvidia/Llama-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 19 days ago • 15.2M • 12.8k • 327

liked a Space 20 days ago

270

Thera Arbitrary-Scale Super-Resolution

🔥

Enhance image quality with real-time super-resolution

upvoted a paper 21 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 24 days ago • 27

liked a model 24 days ago

sesame/csm-1b

Text-to-Speech • Updated 21 days ago • 80.8k • • 1.8k

liked a model 26 days ago

nvidia/DeepSeek-R1-FP4

Text Generation • Updated 3 days ago • 50k • 232