SAMBIT CHAKRABORTY's picture

30 8

SAMBIT CHAKRABORTY

sambitchakhf03

·

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Accelerating Language Model Inference with Mixture of Attentions

upvoted a paper 5 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

upvoted a paper 5 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

View all activity

Organizations

sambitchakhf03's activity

upvoted an article 4 days ago

Article

Accelerating Language Model Inference with Mixture of Attentions

By

•

6 days ago

• 24

upvoted 2 papers 5 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 9 days ago • 72

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 7 days ago • 34

upvoted a paper 15 days ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 20 days ago • 44

upvoted 2 papers about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 101

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 71

upvoted a paper 4 months ago

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 18

upvoted a collection 4 months ago

Seamless Communication

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 151

upvoted a paper 5 months ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 58

upvoted a collection 6 months ago

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated 2 days ago • 11

upvoted a paper 6 months ago

SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing

Paper • 2110.07205 • Published Oct 14, 2021 • 5

upvoted a collection 6 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 5 days ago • 546

upvoted 2 papers 7 months ago

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21, 2024 • 14

Is Programming by Example solved by LLMs?

Paper • 2406.08316 • Published Jun 12, 2024 • 12

upvoted an article 8 months ago

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages

May 24, 2024

• 25

upvoted an article 9 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22, 2024

• 80

upvoted 2 papers 9 months ago

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 254

upvoted a collection 9 months ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 702

upvoted a paper 9 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 65