dalle-mini/dalle-mini · Trying ThunderJames' equation method

Jun 12, 2022

Style=Salvador Dali, Subject=woman wearing earrings + anatomically correct blue eyes

I think the face is more coherent than normal. I'm guessing there are more training images of Dali in the model. I guess "anatomically correct" is influencing output more than normal

ThunderJames

Jun 12, 2022

hahahahahLMAO

ThunderJames

Jun 12, 2022

Try Salvador Dali style with high level of detail: Subject= woman👩, subject detail: anatomically accurate real blue eyes=detailed👁👁and Blue earrings

JackFruit7

Jun 12, 2022

Prompt: Salvador Dali style with high level of detail: Subject= woman👩, subject detail: anatomically accurate real blue eyes=detailed👁👁 cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve + Blue sapphire earrings

JackFruit7

Jun 12, 2022

Prompt: Photograph portrait of a old woman, studio lighting, highly detailed, anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve

ThunderJames

Jun 12, 2022

WE ARE ALMOST THERE niiicee

JackFruit7

Jun 12, 2022

given there must be hundreds, thousands of faces in the training data what are your thoughts about the difficulty of reproducing faces?

JackFruit7

Jun 12, 2022

Prompt: (Photograph, picture, photo) (medium close-up, medium closeup) (portrait photo) of an (old woman crying, tears, sadness, unhappy, pain), (studio lighting, photographic studio lighting) (fine detail, highly detailed, finely detailed) (anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

Trying ThunderJames' intuition of 'equation style' prompts

JackFruit7

Jun 12, 2022

Prompt: (Photograph, picture, photo) (wide shot, full body shot, 24mm lens, 30mm lens) (portrait photo) of an (old woman sitting in a chair, old woman sitting down) (old woman crying, tears, sadness, unhappy, pain), (studio lighting, photographic studio lighting) (fine detail, highly detailed, finely detailed) (anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

Comment: The lens specification looks promising. The non-verbal emotion in the posing is outstanding. But why has she turned into a furry? :)

ThunderJames

Jun 12, 2022

given there must be hundreds, thousands of faces in the training data what are your thoughts about the difficulty of reproducing faces?

it really depends i guess , either you want to black box it and just let the AI figure it out. Or we figure out a better impute language with a higher level of accuracy. Thats why I'm trying to figure out how I can specify toward certain detail and how I can split the process while the AI still understands. Thats why I started simple with basic features like subject ground and background. But I believe there are far more spesification's that are yet to be discovered. I don't know jack shit about programming so for me its all just intuitive and simple trial and error.

JackFruit7

Jun 12, 2022

•

edited Jun 12, 2022

Prompt: (Photograph, picture, photo) (wide shot, full body shot, 24mm lens, 30mm lens) (portrait photo) (studio lighting, photographic studio lighting) of an (old woman sitting in a chair, old woman sitting down) (old woman's face is crying, the old woman has tears in her eyes, sadness, unhappy, pain) (anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

Comment: Nope!

JackFruit7

Jun 12, 2022

"either you want to black box it and just let the AI figure it out. Or we figure out a better impute language with a higher level of accuracy"

Yea. I like the idea of some level of control

nomecopies

Jun 12, 2022

Tried something similar (first try with the equation) but no eyes at all

Prompt: (Photograph,still,frame)(full body shot) of a (little girl, young girl) with a candy (crying, sad, tears) (anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

ThunderJames

Jun 12, 2022

Prompt: (Photograph, picture, photo) (wide shot, full body shot, 24mm lens, 30mm lens) (portrait photo) of an (old woman sitting in a chair, old woman sitting down) (old woman crying, tears, sadness, unhappy, pain), (studio lighting, photographic studio lighting) (fine detail, highly detailed, finely detailed) (anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

Comment: The lens specification looks promising. The non-verbal emotion in the posing is outstanding. But why has she turned into a furry? :)

for example here you can see how the AI just gives up on the faces. Thats why I sometime "remind" it with emojis but also stuff like
subject and subject detail
here for example I might try out

(Photograph, picture, photo) (wide shot, full body shot, 24mm lens, 30mm lens) (portrait photo): Subject= grandma, Subject details: (Pose = sitting in a chair, sitting down ,)+ (Emotion=crying, tears, sadness, unhappy, pain) + (face=anatomically correct human eyes, detailed: cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve)

I then like to watch the results and see where the input and the output don't connect. Then I zero in on that zone and just work it out

ThunderJames

Jun 12, 2022

yeah i ran into the same issue , Im currently working on it

ThunderJames

Jun 13, 2022

this is why you saw my equation getting complicated, because I was tweaking to combat this issue. In dalle mini - verbal input alone is not enough

ThunderJames

Jun 13, 2022

A lack of detail in input can result to a lack of detail in output I seem to observe

JackFruit7

Jun 13, 2022

@ThunderJames "Then I zero in on that zone and just work it out" I do something similar. I went back to one up top that looked better (no equation)

Prompt: detailed photograph of woman's face smiling, blue eyes, cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve

Comment: Gotta do some chores. I expect y'all will have this all worked out when I get back! :)

DAMGDegen

Jun 13, 2022

Gents, so I see everyone is headed in a similar direction so let's make this badass and do it right. And especially knock it out with all of the server busy issues.

I'm setting up some servers with a couple T4's and RTX 90's as GPUs and getting a few devs to build & customize the code to make this even more amazing.

Let's see what we can come up with - see you all on the other side...
https://discord.gg/2DH2Cxg99S

...

ThunderJames

Jun 13, 2022

@ThunderJames "Then I zero in on that zone and just work it out" I do something similar. I went back to one up top that looked better (no equation)

Prompt: detailed photograph of woman's face smiling, blue eyes, cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve

Comment: Gotta do some chores. I expect y'all will have this all worked out when I get back! :)

im gonna do my best lol

ThunderJames

Jun 13, 2022

•

edited Jun 13, 2022

@ThunderJames "Then I zero in on that zone and just work it out" I do something similar. I went back to one up top that looked better (no equation)

Prompt: detailed photograph of woman's face smiling, blue eyes, cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve

Comment: Gotta do some chores. I expect y'all will have this all worked out when I get back! :)

yeah the issues arise when you want more information of things to come together as one. then the inputs kind of become challenging , I don't know if its too vague or something

JackFruit7

Jun 13, 2022

Prompt: symmetrical human face photograph
Comment: Chores can wait. I think Dalle-mini struggles with facial symmetry so I added "symmetrical "

JackFruit7

Jun 13, 2022

Prompt: photograph of human face || Facial bilateral symmetry || anatomically correct human eyes, cornea, iris, pupil, aqueous humor, lens, vitreous humor, retina, and optic nerve||
Comment: tried symmetrical again. used pipes || but I don't think it helps. I feel the many words describing the eye makes Dalle-mini focus on that idea. Would be nice to work out how to 'weight' different ideas that are joined together in a single prompt.

ThunderJames

Jun 13, 2022

Prompt: symmetrical human face photograph
Comment: Chores can wait. I think Dalle-mini struggles with facial symmetry so I added "symmetrical "

Jack you need to get past the censoring for that you must find a way for the computer to understand your other inputs in combination with the eye prompt you made as one. It needs to interpret it as that information belonging together. Right now it only sees them as two different things gets why you get either the one eye or the blurry or not even really visible eyes with the face.
I'm still working on that part