Embedding Models (English) Collection Various English language embedding models in GGUF format • 3 items • Updated Feb 20 • 1
ShareGPT Datasets Collection Datasets (not by me) that I converted to the ShareGPT format • 5 items • Updated 23 days ago • 1
Spaces of the Week Collection My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗 • 6 items • Updated 23 days ago • 2
My Projects Collection Projects I've worked on (includes collabs) • 19 items • Updated 5 days ago • 6
PhotoMaker Collection Let us create photos/paintings/avatars for anyone in any style within seconds. • 3 items • Updated Jan 18 • 15
T2I-Adapter-SDXL Collection The smallest and most efficient control models for SDXL! • 8 items • Updated Sep 8, 2023 • 23
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 19 days ago • 36
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 3 items • Updated Mar 9 • 2
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated Feb 19 • 28
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 1 item • Updated Feb 19 • 14
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 6 items • Updated 16 days ago • 12
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 3 items • Updated Feb 19 • 2
Arctic-embed Collection A collection of text embedding models optimized for retrieval accuracy and efficiency • 5 items • Updated Apr 17 • 10
Arctic Collection A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated 29 days ago • 19
SpeechT5 Collection The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated about 17 hours ago • 11
TAPEX Collection TAPEX is the state-of-the-art table pre-training models which can be used for table-based question answering and table-based fact verification. • 10 items • Updated about 17 hours ago • 4
Table Transformer Collection The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated about 17 hours ago • 12
LayoutLM Collection The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA. • 5 items • Updated about 17 hours ago • 9
Orca Collection The Orca family of LMs developed by Microsoft. • 2 items • Updated about 17 hours ago • 4
UDOP Collection UDOP is a general multimodal model for document AI • 4 items • Updated about 17 hours ago • 20
GIT Collection GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering. • 18 items • Updated about 17 hours ago • 4
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 20 items • Updated about 17 hours ago • 260
LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 70 items • Updated 6 days ago • 308
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 534
Japanese Multimodal Models Collection Suite of multimodal models focusing on Japan/Japanese-related usage • 4 items • Updated Apr 8 • 4
Japanese Stable LM Collection Suite of LLMs focusing on Japanese usage • 15 items • Updated 16 days ago • 13
DBRX Collection DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 89
InstructBLIP models Collection A collection that contains all InstructBLIP models! • 4 items • Updated 12 days ago • 2
InstructRetro Collection InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 24 days ago • 7
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Feb 19 • 37
⭐ StarCoder Collection All models, datasets, and demos related to StarCoder! • 11 items • Updated Feb 27 • 19
MetricX-23 Collection A collection of MetricX-23 models (https://aclanthology.org/2023.wmt-1.63/) • 6 items • Updated 8 days ago • 12
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated 8 days ago • 24
Switch-Transformers release Collection This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated 8 days ago • 11
SEAHORSE release Collection The SEAHORSE metrics (as described in https://arxiv.org/abs/2305.13194). • 12 items • Updated 8 days ago • 16
MT5 release Collection The MT5 release follows the T5 family, but is pretrained on multilingual data. The update UMT5 models are pretrained on an updated corpus. • 10 items • Updated 8 days ago • 12