Licenses of curated dataset

#3
by uaknight - opened

Hi, thanks for this model!

I'm looking for a model which has been trained on images from the public domain, exclusively. Or images carrying a similar license.

In the description of this model it says:

"This model was trained on a curated dataset of roughly 300 images hand-picked from Midjourney, Prompt Hero, PixaBay, Open Journey V2, and Reddit."

So, I wonder, were these images picked with this in mind?

Regards,
Ulf

Hi Ulf!

All images used for training are either ai-generated or public domain (CC0 license). I have not checked if the ai-generated images are released under CC0 or a similar license. If you want a model that uses only public domain images, I recommend Mitsua Diffusion One.

Oh, and I'm working on an innovative text-to-image model architecture that is trained using only CC0 images. I estimate that this model can achieve higher image quality than SDXL 1.0 in terms of image naturalness while using 25%-50% less VRAM and running 200%-250% faster.

Hope that helped!

Fred

Hi Fred, and thank you so much for your answer!

I'll having a look at the Mitsua model, it seems to be what I'm searching for. I'm new to data science and ML and there's a lot of new stuff to digest.

Am a bit surprised that not more art models are built on CC0-only images. For example Steam, the world's biggest games publisher, is actively rejecting games which include AI art that the creator cannot claim legal use of, so there should be use cases already.

I sure will stay tuned for your new model, it looks exciting!

Thanks again,
Ulf

Sign up or log in to comment