An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion Paper • 2208.01618 • Published Aug 2, 2022 • 1
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published 13 days ago • 68
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published 12 days ago • 64
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published 6 days ago • 86
ControlNet-XS: Designing an Efficient and Effective Architecture for Controlling Text-to-Image Diffusion Models Paper • 2312.06573 • Published Dec 11, 2023 • 1
Common Diffusion Noise Schedules and Sample Steps are Flawed Paper • 2305.08891 • Published May 15, 2023 • 8
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 56
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25 • 84
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper • 2406.10210 • Published Jun 14 • 76
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality Paper • 2405.21060 • Published May 31 • 63
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper • 2312.00752 • Published Dec 1, 2023 • 138
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 114
PuLID: Pure and Lightning ID Customization via Contrastive Alignment Paper • 2404.16022 • Published Apr 24 • 19
Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation Paper • 2403.12015 • Published Mar 18 • 63
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Paper • 2403.05135 • Published Mar 8 • 42
Prompt-to-Prompt Image Editing with Cross Attention Control Paper • 2208.01626 • Published Aug 2, 2022 • 2
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers Paper • 2211.01324 • Published Nov 2, 2022 • 3
T-Stitch: Accelerating Sampling in Pre-Trained Diffusion Models with Trajectory Stitching Paper • 2402.14167 • Published Feb 21 • 10
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation Paper • 2406.02347 • Published Jun 4 • 1
Mixture of Diffusers for scene composition and high resolution image generation Paper • 2302.02412 • Published Feb 5, 2023 • 1
Score-Based Generative Modeling through Stochastic Differential Equations Paper • 2011.13456 • Published Nov 26, 2020 • 2
Elucidating the Design Space of Diffusion-Based Generative Models Paper • 2206.00364 • Published Jun 1, 2022 • 13
view article Article Simple considerations for simple people building fancy neural networks Feb 25, 2021 • 2
view article Article Deploy hundreds of open source models on one GPU using LoRAX By macadeliccc • Jul 18 • 3
view article Article In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite By whitphx • Jul 12 • 9
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents Paper • 1901.08149 • Published Jan 23, 2019 • 3
Recent highlights Collection Some recent models worth checking out • 15 items • Updated 14 days ago • 25
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 494