view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 3 days ago β’ 27
LLaVA++ (LLaMA-3 and Phi-3-Mini) Collection Extending Visual Capabilities of LLaVA with LLaMA-3 and Phi-3 β’ 11 items β’ Updated 4 days ago β’ 21
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published 12 days ago β’ 226
Parler-TTS, fully open-source high-quality TTS models Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. β’ 5 items β’ Updated 24 days ago β’ 7
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper β’ 2403.13372 β’ Published Mar 20 β’ 50
Luganda Whisper ASR Collection Luganda Speech To Text/ Automatic Speech Recognition β’ 4 items β’ Updated Mar 17 β’ 1
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11 β’ 34
Transformers compatible Mamba Collection This release includes the `mamba` repositories compatible with the `transformers` library β’ 5 items β’ Updated Mar 6 β’ 25
MobiLlama Collection Collection of MobiLlama Language Models. β’ 6 items β’ Updated 8 days ago β’ 14
Quyen Collection State-of-the-arts General LLMs - based on Qwen1.5 β’ 26 items β’ Updated Feb 13 β’ 12
π΅ The MusicBox Collection A collection full of musical tasks demos, for musicians & music enthusiasts β’ 26 items β’ Updated Mar 8 β’ 15
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated 5 days ago β’ 158
StemGen: A music generation model that listens Paper β’ 2312.08723 β’ Published Dec 14, 2023 β’ 45
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. β’ 16 items β’ Updated Jan 16 β’ 120