Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 22 days ago • 51
SD 2.x, Zero-terminal SNR Collection SD 2.x models with zero terminal SNR noise schedule. • 3 items • Updated Nov 3, 2023 • 2
view article Article Enjoy the Power of Phi-3 with ONNX Runtime on your device By Emma-N • 1 day ago • 16
INDUS: Effective and Efficient Language Models for Scientific Applications Paper • 2405.10725 • Published 6 days ago • 14
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 6 days ago • 97
view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x • 17 days ago • 24
Depth Anything Release Collection Depth Anything models, foundation models for monocular depth estimation, trained on 1.5 million labeled images and 62 million unlabeled images • 8 items • Updated Jan 26 • 7
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 27 days ago • 55
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 9 days ago • 305
Specialized Language Models with Cheap Inference from Limited Domain Data Paper • 2402.01093 • Published Feb 2 • 45
Canonical models Collection This collection lists all the historical (pre-"Hub") canonical model checkpoints, i.e. repos that were not under an org or user namespace • 68 items • Updated Feb 13 • 13
Scalable Pre-training of Large Autoregressive Image Models Paper • 2401.08541 • Published Jan 16 • 35
CTRL: A Conditional Transformer Language Model for Controllable Generation Paper • 1909.05858 • Published Sep 11, 2019 • 4
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Paper • 2401.05252 • Published Jan 10 • 43
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 78
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 8 items • Updated 9 days ago • 25
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 28 items • Updated Mar 23 • 181
MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices Paper • 2312.16886 • Published Dec 28, 2023 • 18
LLM in a flash: Efficient Large Language Model Inference with Limited Memory Paper • 2312.11514 • Published Dec 12, 2023 • 253
Latent Consistency Models LoRAs Collection Latent Consistency Models for Stable Diffusion - LoRAs and full fine-tuned weights • 4 items • Updated Nov 10, 2023 • 95
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference Paper • 2310.04378 • Published Oct 6, 2023 • 19
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Paper • 2310.05737 • Published Oct 9, 2023 • 4
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics Paper • 2310.13268 • Published Oct 20, 2023 • 15
Historical - Spaces of the Week Collection All Spaces of the Week...from all weeks • 636 items • Updated Jan 17 • 19
LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 70 items • Updated 7 days ago • 308
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 447
High-Resolution Image Synthesis with Latent Diffusion Models Paper • 2112.10752 • Published Dec 20, 2021 • 7
Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation Paper • 2305.01569 • Published May 2, 2023 • 2
HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models Paper • 2307.06949 • Published Jul 13, 2023 • 49
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper • 2307.01952 • Published Jul 4, 2023 • 74
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 235
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance Paper • 2307.00522 • Published Jul 2, 2023 • 27