Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 2 days ago • 9
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 2 days ago • 9
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 11 days ago • 226
OBELICS 📚🔍 Collection Collection gathering artifacts related to OBELICS • 4 items • Updated 18 days ago • 5
🐶 IDEFICS 🐶 Collection Collection assembling all the models and spaces related to IDEFICS • 6 items • Updated 18 days ago • 7
From screenshots to HTML Collection WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated 18 days ago • 15
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 9 items • Updated about 7 hours ago • 58
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models Paper • 2404.07839 • Published 22 days ago • 36
MuPT: A Generative Symbolic Music Pretrained Transformer Paper • 2404.06393 • Published 24 days ago • 14
StarChat2 15B Collection Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 10 items • Updated 21 days ago • 11
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6 • 171
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Feb 19 • 26
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 1 item • Updated Feb 19 • 12
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 4 days ago • 158
OLMo Suite Collection Artifacts for the first set of OLMo models. • 12 items • Updated 9 days ago • 34
MAGNeT Collection Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated 28 days ago • 30
Seamless: Multilingual Expressive and Streaming Speech Translation Paper • 2312.05187 • Published Dec 8, 2023 • 8
Apple MLX-compatible 7B LLMs on the 🤗 Hub Collection This collection contains the model weights for 7B LLMs for Apple's MLX framework. Find more information at https://github.com/ml-explore/mlx • 8 items • Updated Mar 26 • 9
Distil-Whisper Models Collection The first version of the Distil-Whisper models released with the Distil-Whisper paper. • 4 items • Updated Mar 21 • 33
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 120
Controllable Music Production with Diffusion Models and Guidance Gradients Paper • 2311.00613 • Published Nov 1, 2023 • 23
Audio Codecs Embeddings 🎙️ Collection A collection of codec and embedding models supported in 🤗 Transformers. • 2 items • Updated Sep 16, 2023 • 1
Text to Music 🎧 Collection A collection of music generation models supported in 🤗 Transformers and 🧨 Diffusers • 5 items • Updated Sep 16, 2023 • 2
Audio Classification 🔊 Collection A collection of audio classification models supported in 🤗 Transformers • 3 items • Updated Sep 16, 2023 • 3
Text to Speech 🗣️ Collection A collection of TTS models supported in 🤗 Transformers. • 4 items • Updated Sep 16, 2023 • 5
Automatic Speech Recognition 📝 Collection A collection of ASR models supported in 🤗 Transformers • 11 items • Updated Sep 16, 2023 • 5
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Paper • 2202.10408 • Published Feb 21, 2022 • 3
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 233
VampNet: Music Generation via Masked Acoustic Token Modeling Paper • 2307.04686 • Published Jul 10, 2023 • 19