Spaces:

microsoft
/

Promptist

Running

App Files Files Community

Inference takes huge time and results are not astonishing

by MonsterMMORPG - opened Dec 21, 2022

Discussion

MonsterMMORPG

Dec 21, 2022

•

edited Dec 21, 2022

I have tested keyword owl

It took over 3 minutes to generate output on core i7 10700F and RTX 3060 computer

the output is : owl, by wlop, artgerm

output is good but definitely very weak

also if you are interested in how to use stable diffusion i have a playlist and 3 tutorial videos so far

https://www.youtube.com/playlist?list=PL_pbwdIyffsmclLl0O144nQRnezKlNdx3

for the below owl I have used the following input

Model anything v3

owl, bird owl,animal owl, by ARTIST_NAME, fantasy, intricate, smooth, sharp focus, illustration, intricate, cinematic lighting, highly detailed, octane, digital painting, artstation, concept art, smooth, sharp focus, illustration, vibrant colors, 3d render, cinematic, high quality, amazing, masterpiece, featured on deviantart, artstation

Negative prompt:

woman,female,human,bad anatomy, bw, black and white, ugly, tiling, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, extra limbs, disfigured, deformed, body out of frame, blurry, bad anatomy, blurred, watermark, grainy, signature, cut off, draft, amateur, multiple, gross, weird, uneven, furnishing, decorating, decoration, furniture, text, poor, low, basic, worst, juvenile, unprofessional, failure, crayon, oil, nude, sex, label, thousand hands

unilm

Microsoft org Dec 22, 2022

Thanks for your feedback. The regular time on CPU is about 15s. It took 3 minutes because there were too many queries waiting in the queue. We recommend loading the model into local nodes with GPU.

HughPH

Dec 22, 2022

•

edited Dec 23, 2022

For speed see my edits here:
https://huggingface.co/spaces/microsoft/Promptist/discussions/1/files

Works pretty fast even on my Xeon E5-1650 v4 (from 2016). Much faster than 15 seconds. I get a long completion in ~1.5s. Loading it into the GPU probably isn't necessary unless you want to do thousands.

MonsterMMORPG

Dec 23, 2022

Thanks for your feedback. The regular time on CPU is about 15s. It took 3 minutes because there were too many queries waiting in the queue. We recommend loading the model into local nodes with GPU.

I tested on my pc here the source code

MonsterMMORPG

Dec 23, 2022

For speed see my edits here:
https://huggingface.co/spaces/microsoft/Promptist/discussions/1/files

Works pretty fast even on my Xeon E5-1650 v4 (from 2016). Much faster than 15 seconds. I get a long completion in ~1.5s. Loading it into the GPU probably isn't necessary unless you want to do thousands.

so they use 8 beams but displaying only 1 :D

HughPH

Dec 23, 2022

so they use 8 beams but displaying only 1 :D

Looks like it! Did you try the edits? Using the token indices is also important, since encoding & decoding can mutate the text (and its length). I first ran into that issue with GPT-J-6B.

MonsterMMORPG

Dec 23, 2022

so they use 8 beams but displaying only 1 :D

Looks like it! Did you try the edits? Using the token indices is also important, since encoding & decoding can mutate the text (and its length). I first ran into that issue with GPT-J-6B.

I just did a test with your changes and it took like 1 sec :d

the result is also better

for owl keyword : owl, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha, high definition

HughPH

Jan 1, 2023

Glad to know it's working well for you!

the result is also better

My changes shouldn't have any effect on the quality of the result in the vast majority of cases, the second change just ensures that the only tokens that are decoded are the ones after the prompt. The original code decodes all the tokens and uses the string position to get the generated tokens, which is unreliable.

unilm

Microsoft org Jan 19, 2023

For speed see my edits here:
https://huggingface.co/spaces/microsoft/Promptist/discussions/1/files

Works pretty fast even on my Xeon E5-1650 v4 (from 2016). Much faster than 15 seconds. I get a long completion in ~1.5s. Loading it into the GPU probably isn't necessary unless you want to do thousands.

I duplicate the demo at https://huggingface.co/spaces/unilm/Promptist-faster and merge your update.

I also added a note as follows:

Note: This is a version with beam_size=1 while the original demo uses beam_size=8. So there would be a difference in terms of performance, but this demo is much faster. Many thanks to @HughPH for pointing out this improvement.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment