Ankit Sharma

nezubn

https://nezubn.com

AI & ML interests

engineering • systems • ml

Recent Activity

liked a model 29 days ago

bartowski/Llama-3.1_OpenScholar-8B-GGUF

upvoted a paper about 1 month ago

Cut Your Losses in Large-Vocabulary Language Models

upvoted a paper about 2 months ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

View all activity

Organizations

nezubn's activity

upvoted a paper about 1 month ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13 • 42

upvoted a paper about 2 months ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21 • 23

upvoted 2 articles 4 months ago

Article

Optimizing your LLM in production

Sep 15, 2023

• 15

Article

Getting Started With Embeddings

Jun 23, 2022

• 39

upvoted an article 6 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 31

upvoted 2 papers 7 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

upvoted a paper 8 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted an article 9 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 100

upvoted 10 papers 9 months ago

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 62

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2 • 44

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29 • 34

upvoted a paper 11 months ago

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29 • 48