-
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Paper • 2310.03502 • Published • 74 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 8 -
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
Paper • 2311.15127 • Published • 6 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 8
Collections
Discover the best community collections!
Collections including paper arxiv:2103.00020
-
The Unreasonable Effectiveness of Deep Features as a Perceptual Metric
Paper • 1801.03924 • Published • 2 -
Fine-Tuning Language Models from Human Preferences
Paper • 1909.08593 • Published • 2 -
Training Verifiers to Solve Math Word Problems
Paper • 2110.14168 • Published • 4 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 8
-
Demystifying CLIP Data
Paper • 2309.16671 • Published • 17 -
Model Stock: All we need is just a few fine-tuned models
Paper • 2403.19522 • Published • 9 -
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models
Paper • 2404.01367 • Published • 19 -
On the Scalability of Diffusion-based Text-to-Image Generation
Paper • 2404.02883 • Published • 17
-
FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Paper • 2403.06775 • Published • 3 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 5 -
Data Incubation -- Synthesizing Missing Data for Handwriting Recognition
Paper • 2110.07040 • Published • 2 -
A Mixture of Expert Approach for Low-Cost Customization of Deep Neural Networks
Paper • 1811.00056 • Published • 2
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper • 1106.1813 • Published • 1 -
Scikit-learn: Machine Learning in Python
Paper • 1201.0490 • Published • 1 -
Identity Mappings in Deep Residual Networks
Paper • 1603.05027 • Published • 2 -
Deep Residual Learning for Image Recognition
Paper • 1512.03385 • Published • 5
-
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Paper • 2309.16414 • Published • 19 -
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Paper • 2309.13018 • Published • 9 -
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 11 -
Language models in molecular discovery
Paper • 2309.16235 • Published • 10
-
U-Net: Convolutional Networks for Biomedical Image Segmentation
Paper • 1505.04597 • Published • 5 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 3 -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 8 -
Denoising Diffusion Implicit Models
Paper • 2010.02502 • Published • 3