Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper β’ 2405.09818 β’ Published 7 days ago β’ 82
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance Paper β’ 2405.12979 β’ Published 2 days ago β’ 6
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention Paper β’ 2405.12981 β’ Published 2 days ago β’ 11
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control Paper β’ 2405.12970 β’ Published 2 days ago β’ 14
Diffusion for World Modeling: Visual Details Matter in Atari Paper β’ 2405.12399 β’ Published 3 days ago β’ 19
πGGUF Collection Llama.cpp compatible models, can be used on CPUs and GPUs! β’ 661 items β’ Updated about 20 hours ago β’ 23
INDUS: Effective and Efficient Language Models for Scientific Applications Paper β’ 2405.10725 β’ Published 6 days ago β’ 14
ZeroGPU Spaces Collection ZeroGPU Spaces made by the community β’ 16 items β’ Updated 6 days ago β’ 162
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 11 items β’ Updated 6 days ago β’ 97
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 14 items β’ Updated about 22 hours ago β’ 125
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model 10 days ago β’ 113
view article Article Hugging Face x LangChain : A new partner package in LangChain 10 days ago β’ 65
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper β’ 2405.00732 β’ Published 24 days ago β’ 110
Customizing Text-to-Image Models with a Single Image Pair Paper β’ 2405.01536 β’ Published 21 days ago β’ 17
LLM-AD: Large Language Model based Audio Description System Paper β’ 2405.00983 β’ Published 21 days ago β’ 13
FLAME: Factuality-Aware Alignment for Large Language Models Paper β’ 2405.01525 β’ Published 21 days ago β’ 21
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper β’ 2405.01481 β’ Published 21 days ago β’ 20
WildChat: 1M ChatGPT Interaction Logs in the Wild Paper β’ 2405.01470 β’ Published 21 days ago β’ 53
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper β’ 2405.01434 β’ Published 21 days ago β’ 44
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper β’ 2405.01535 β’ Published 21 days ago β’ 96
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper β’ 2404.18796 β’ Published 24 days ago β’ 63
Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting Paper β’ 2404.18911 β’ Published 24 days ago β’ 26
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper β’ 2404.16994 β’ Published 28 days ago β’ 31
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs Paper β’ 2404.16873 β’ Published Apr 21 β’ 25
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published 28 days ago β’ 55
What matters when building vision-language models? Paper β’ 2405.02246 β’ Published 20 days ago β’ 84
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). β’ 6 items β’ Updated 20 days ago β’ 36
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face 21 days ago β’ 13
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper β’ 2404.15653 β’ Published 29 days ago β’ 24
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper β’ 2309.10400 β’ Published Sep 19, 2023 β’ 22
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published about 1 month ago β’ 120
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs Apr 16 β’ 11
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 20 items β’ Updated 1 day ago β’ 264
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published about 1 month ago β’ 235
view article Article LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!) By wolfram β’ 29 days ago β’ 45
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais β’ Apr 18 β’ 20
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Apr 18 β’ 535
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees β’ 11 items β’ Updated Apr 15 β’ 22
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 129
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 17 days ago β’ 78
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper β’ 2404.03715 β’ Published Apr 4 β’ 58
C4AI Command R Plus Collection C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. β’ 3 items β’ Updated Apr 5 β’ 13
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 78