SAMBIT CHAKRABORTY

sambitchakhf03

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Token-Efficient Long Video Understanding for Multimodal LLMs

upvoted a paper 12 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

upvoted a paper 16 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

View all activity

Organizations

sambitchakhf03's activity

upvoted a paper 1 day ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 9 days ago • 79

upvoted a paper 12 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 22 days ago • 162

upvoted 3 papers 16 days ago

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published 17 days ago • 53

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published Feb 4 • 10

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 23 days ago • 66

upvoted a paper 28 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 30 days ago • 143

upvoted a paper 29 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 124

upvoted 4 papers about 1 month ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 57

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 24

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Paper • 2502.03275 • Published Feb 5 • 15

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted 4 papers about 2 months ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 53

upvoted a paper 2 months ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 61

upvoted an article 2 months ago

Article

Accelerating Language Model Inference with Mixture of Attentions

and 1 other •

Jan 7

• 24

upvoted 2 papers 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published Jan 6 • 41

upvoted a paper 3 months ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46