-
AutoCLIP: Auto-tuning Zero-Shot Classifiers for Vision-Language Models
Paper • 2309.16414 • Published • 19 -
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model
Paper • 2309.13018 • Published • 9 -
Robust Speech Recognition via Large-Scale Weak Supervision
Paper • 2212.04356 • Published • 11 -
Language models in molecular discovery
Paper • 2309.16235 • Published • 10
Collections
Discover the best community collections!
Collections including paper arxiv:2309.15091
-
Compositional Foundation Models for Hierarchical Planning
Paper • 2309.08587 • Published • 9 -
DreamLLM: Synergistic Multimodal Comprehension and Creation
Paper • 2309.11499 • Published • 57 -
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Paper • 2309.15091 • Published • 31 -
Context-Aware Meta-Learning
Paper • 2310.10971 • Published • 14
-
Sparse Autoencoders Find Highly Interpretable Features in Language Models
Paper • 2309.08600 • Published • 11 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 79 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 22 -
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Paper • 2309.15091 • Published • 31
-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper • 2309.07749 • Published • 6 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 23 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 51 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper • 2309.06895 • Published • 27
-
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Paper • 2309.00398 • Published • 18 -
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Paper • 2307.04725 • Published • 63 -
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance
Paper • 2307.00522 • Published • 27 -
VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Paper • 2309.15091 • Published • 31