3 30 14

quyettv

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

commented a paper about 1 month ago

upvoted a paper about 1 month ago

Organizations

None yet

quyettv's activity

upvoted a paper 21 days ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

upvoted a paper about 1 month ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

upvoted 2 papers about 2 months ago

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3 • 24

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 47

upvoted 3 papers 3 months ago

upvoted 4 papers 4 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 104

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 84

Longhorn: State Space Models are Amortized Online Learners

Paper • 2407.14207 • Published Jul 19 • 17

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1 • 85

upvoted a paper 5 months ago

AgentInstruct: Toward Generative Teaching with Agentic Flows

Paper • 2407.03502 • Published Jul 3 • 48

upvoted an article 5 months ago

Article

Welcome Gemma 2 - Google's new open LLM

Jun 27

• 123

upvoted 4 papers 5 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

Paper • 2406.15927 • Published Jun 22 • 13

LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21 • 61

An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13 • 50

upvoted an article 6 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 369

upvoted 2 papers 6 months ago

ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 90

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Paper • 2405.15738 • Published May 24 • 43