-
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 53 -
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data
Paper • 2309.13876 • Published • 1 -
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Paper • 2310.06434 • Published • 4
Collections
Discover the best community collections!
Collections including paper arxiv:2311.00430
-
Masked Autoencoders Are Scalable Vision Learners
Paper • 2111.06377 • Published • 2 -
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 53 -
distil-whisper/distil-large-v2
Automatic Speech Recognition • Updated • 28.2k • 490 -
Seven Failure Points When Engineering a Retrieval Augmented Generation System
Paper • 2401.05856 • Published • 2
-
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Paper • 2312.03818 • Published • 31 -
Scaling Laws of Synthetic Images for Model Training ... for Now
Paper • 2312.04567 • Published • 7 -
Large Language Models for Mathematicians
Paper • 2312.04556 • Published • 11 -
LooseControl: Lifting ControlNet for Generalized Depth Conditioning
Paper • 2312.03079 • Published • 12
-
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling
Paper • 2311.00430 • Published • 53 -
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Paper • 2307.01952 • Published • 74 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 80 -
Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models
Paper • 2311.00871 • Published • 2
-
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 26 -
LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
Paper • 2311.05556 • Published • 73 -
Distributed Deep Learning in Open Collaborations
Paper • 2106.10207 • Published • 1 -
Datasets: A Community Library for Natural Language Processing
Paper • 2109.02846 • Published • 7
-
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 39 -
Data Filtering Networks
Paper • 2309.17425 • Published • 6 -
FlashDecoding++: Faster Large Language Model Inference on GPUs
Paper • 2311.01282 • Published • 30 -
E3 TTS: Easy End-to-End Diffusion-based Text to Speech
Paper • 2311.00945 • Published • 11