I just want to say congrats on producing a really great use-case for latent diffusion models.

Like many great inventions, it's mechanism is simple once it's understood but I believe it proves the power of conditional image synthesis models to generate any modality that can be represented as an image.

Don't let the haters get you down, this is awesome!

Closing as this is not really a discussion topic, but thanks for the kind words.

