Running on Zero 1.89k 1.89k Chat With Janus-Pro-7B 🌍 A unified multimodal understanding and generation model.
Running 872 872 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
End-to-end speaker segmentation for overlap-aware resegmentation Paper • 2104.04045 • Published Apr 8, 2021 • 2
Training Datasets Collection A collection of pseudo-labelled datasets used to train the Distil-Whisper model. • 9 items • Updated Mar 21, 2024 • 14
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context Jul 23, 2024 • 230