Tripp Lyons's picture

Tripp Lyons

tripplyons

·

https://tripplyons.com

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

mit-han-lab/dc-ae-f32c32-sana-1.0-diffusers

liked a model 19 days ago

deepseek-ai/DeepSeek-V3

liked a model 20 days ago

answerdotai/ModernBERT-base

View all activity

Organizations

None yet

tripplyons's activity

upvoted an article 3 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14, 2024

• 61

upvoted a paper 10 months ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 18

upvoted a collection 12 months ago

SigLIP

Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated Dec 13, 2024 • 50

upvoted 4 papers about 1 year ago

Diffusion Model with Perceptual Loss

Paper • 2401.00110 • Published Dec 30, 2023 • 13

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 117

Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization

Paper • 2311.06243 • Published Nov 10, 2023 • 17

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

upvoted 8 papers over 1 year ago

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

CausalLM is not optimal for in-context learning

Paper • 2308.06912 • Published Aug 14, 2023 • 18

LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models

Paper • 2308.16137 • Published Aug 30, 2023 • 39

One Wide Feedforward is All You Need

Paper • 2309.01826 • Published Sep 4, 2023 • 31

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 37

Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

RMT: Retentive Networks Meet Vision Transformers

Paper • 2309.11523 • Published Sep 20, 2023 • 33