Shubham Toshniwal's picture

Shubham Toshniwal

shtoshni

·

https://shtoshni.github.io/

shtoshni

AI & ML interests

NLP, Speech

Organizations

None yet

shtoshni's activity

upvoted a collection 6 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 70

upvoted a paper 6 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 161

upvoted a paper 7 months ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 46

upvoted a paper 8 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 129

upvoted 2 papers 9 months ago

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 29

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

upvoted 8 papers 10 months ago

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 30

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 25

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 32

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 26

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Paper • 2403.14624 • Published Mar 21, 2024 • 52

RewardBench: Evaluating Reward Models for Language Modeling

Paper • 2403.13787 • Published Mar 20, 2024 • 21

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14, 2024 • 126

Chronos: Learning the Language of Time Series

Paper • 2403.07815 • Published Mar 12, 2024 • 47

upvoted 2 papers 11 months ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8, 2024 • 62

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 146

upvoted a collection 11 months ago

OpenMath

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 4 days ago • 41

upvoted 2 papers 11 months ago

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15, 2024 • 37

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

upvoted a collection 11 months ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 83