Direct Preference Heads (Preprint) Collection This collection contains the pre-trained, fine-tuned and aligned models for the Direct Preference Heads paper. • 3 items • Updated 1 day ago • 1
TransformerFAM: Feedback attention is working memory Paper • 2404.09173 • Published Apr 14 • 42
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 7
Learning and Leveraging World Models in Visual Representation Learning Paper • 2403.00504 • Published Mar 1 • 25
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry Paper • 2402.04347 • Published Feb 6 • 13
TransNormerLLM Collection TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer • 10 items • Updated Apr 11 • 3
Tiny Series Collection Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26 • 31
A Large-scale Dataset for Audio-Language Representation Learning Paper • 2309.11500 • Published Sep 20, 2023 • 9
AudioSR: Versatile Audio Super-resolution at Scale Paper • 2309.07314 • Published Sep 13, 2023 • 23
MSViT: Dynamic Mixed-Scale Tokenization for Vision Transformers Paper • 2307.02321 • Published Jul 5, 2023 • 7
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 74
The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit Paper • 2306.17759 • Published Jun 30, 2023 • 3
AniFaceDrawing: Anime Portrait Exploration during Your Sketching Paper • 2306.07476 • Published Jun 13, 2023 • 16
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation Paper • 2306.07954 • Published Jun 13, 2023 • 111
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning Paper • 2306.07967 • Published Jun 13, 2023 • 23
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping Paper • 2306.05544 • Published Jun 8, 2023 • 9
Face0: Instantaneously Conditioning a Text-to-Image Model on a Face Paper • 2306.06638 • Published Jun 11, 2023 • 16
Benchmarking Neural Network Training Algorithms Paper • 2306.07179 • Published Jun 12, 2023 • 22
FasterViT: Fast Vision Transformers with Hierarchical Attention Paper • 2306.06189 • Published Jun 9, 2023 • 29
Retrieval-Enhanced Contrastive Vision-Text Models Paper • 2306.07196 • Published Jun 12, 2023 • 7