Model Description

Implemented and trained a stable diffusion model from scratch (CLIP + VAE + UNet + Cross-Attention), optimizing schedulers (DDPM, DDIM, Euler Ancestral, DPM-Solver++) and attention mechanisms (Flash Attention, xFormers) to reduce generation time by 22% and improve image clarity using MPS on RunPod GPUs.

Downloads last month: -; Downloads are not tracked for this model. How to track

Collection including flying101/StableDiffusionFromScratch

Journey in Computer Vision and Generative Models

Collection

My implementations of generative models and computer vision. Currently built a transformer, stable diffusion, vit, and flow matching by scratch • 3 items • Updated Oct 7, 2025