Soumye Singhal

soumye

AI & ML interests

LLM Post-training

Recent Activity

authored a paper 9 days ago

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

authored a paper 9 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

upvoted a paper 9 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

View all activity

Organizations

soumye's activity

authored 2 papers 9 days ago

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

Paper • 2502.00203 • Published Jan 31

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published 19 days ago • 13

upvoted a paper 9 days ago

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published 19 days ago • 13

liked 3 models 9 days ago

upvoted a collection 9 days ago

Nemotron-H

Collection

Mamba-Transformer hybrid models • 5 items • Updated about 5 hours ago • 21

upvoted a collection 15 days ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated about 5 hours ago • 61

liked a model 16 days ago

DevQuasar/nvidia.Llama-3_1-Nemotron-Ultra-253B-v1-GGUF

Text Generation • Updated 7 days ago • 3.52k • 7

liked a dataset 16 days ago

nvidia/OpenCodeReasoning

Viewer • Updated 8 days ago • 753k • 11.9k • 284

liked a model 16 days ago

nvidia/Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation • Updated 5 days ago • 19.3k • • 263

upvoted a collection about 1 month ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 4 items • Updated about 5 hours ago • 37

liked a model about 1 month ago

nvidia/Llama-3.1-Nemotron-Nano-8B-v1

Text Generation • Updated Mar 16 • 56.9k • 143

liked a dataset about 1 month ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated 7 days ago • 3.91M • 7.19k • 423

liked a model about 1 month ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1

Text Generation • Updated 15 days ago • 130k • • 267

liked a model 10 months ago

nvidia/Nemotron-4-340B-Instruct

Updated Jun 24, 2024 • 72 • 676

upvoted a paper 12 months ago

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 31