SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated about 9 hours ago • 15
NV-Embed Collection NV-Embed is a generalist embedding model that ranks No. 1 on MTEB benchmark encompassing retrieval, reranking, classification, clustering, STS tasks • 1 item • Updated 19 days ago • 4
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 15 days ago • 3
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 19 days ago • 32
InstructRetro Collection InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 19 days ago • 8
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 1 item • Updated 19 days ago • 16
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 6 items • Updated 19 days ago • 15
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated 15 days ago • 11
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 19 days ago • 41