InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper • 2404.19427 • Published 4 days ago • 51
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. • 21 items • Updated 7 days ago • 15
FABLES: Evaluating faithfulness and content selection in book-length summarization Paper • 2404.01261 • Published Apr 1 • 3
Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis Paper • 2404.13686 • Published 13 days ago • 25
view article Article The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare 15 days ago • 57
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 12 days ago • 61
Factorized Diffusion: Perceptual Illusions by Noise Decomposition Paper • 2404.11615 • Published 16 days ago • 2
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated 16 days ago • 39
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback Paper • 2404.07987 • Published 22 days ago • 45
HF-curated models available on Workers AI Collection A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2 • 45
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions Paper • 2403.16627 • Published Mar 25 • 20
Transparent Image Layer Diffusion using Latent Transparency Paper • 2402.17113 • Published Feb 27 • 5
LayerDiffusion: Layered Controlled Image Editing with Diffusion Models Paper • 2305.18676 • Published May 30, 2023 • 1
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models Paper • 2402.19481 • Published Feb 29 • 16
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation Paper • 2402.10210 • Published Feb 15 • 28
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 561
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 79
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem Paper • 2205.01954 • Published May 4, 2022 • 1
Differential Diffusion: Giving Each Pixel Its Strength Paper • 2306.00950 • Published Jun 1, 2023 • 2
SDXL-Lightning: Progressive Adversarial Diffusion Distillation Paper • 2402.13929 • Published Feb 21 • 24
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 22 days ago • 289
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated about 1 month ago • 75
Text-to-Image Base Models Collection All text-to-image open source base models, with their respective license • 28 items • Updated Feb 15 • 17
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects Paper • 2402.09052 • Published Feb 14 • 16
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper • 2402.03620 • Published Feb 6 • 102
OLMo Suite Collection Artifacts for the first set of OLMo models. • 12 items • Updated 10 days ago • 34
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Paper • 2402.00769 • Published Feb 1 • 17
Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All Paper • 2401.13795 • Published Jan 24 • 64
Tweets to Citations: Unveiling the Impact of Social Media Influencers on AI Research Visibility Paper • 2401.13782 • Published Jan 24 • 2
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data Paper • 2401.10891 • Published Jan 19 • 53
MAGNeT Collection Masked Audio Generation using a Single Non-Autoregressive Transformer • 9 items • Updated 29 days ago • 30
Diffusion DPO LoRA Collection How to train: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/diffusion_dpo • 4 items • Updated Jan 12 • 4
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models Paper • 2401.05252 • Published Jan 10 • 42
Optimizing diffusion models Collection Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 20 items • Updated 8 days ago • 12
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Paper • 2308.14469 • Published Aug 28, 2023 • 6
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution Paper • 2401.00935 • Published Jan 1 • 16
Improving Text Embeddings with Large Language Models Paper • 2401.00368 • Published Dec 31, 2023 • 72
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation Paper • 2312.12491 • Published Dec 19, 2023 • 65
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models Paper • 2311.10093 • Published Nov 16, 2023 • 54
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs Paper • 2311.13600 • Published Nov 22, 2023 • 41
LEDITS++: Limitless Image Editing using Text-to-Image Models Paper • 2311.16711 • Published Nov 28, 2023 • 14