1 Click Windows, RunPod & Linux Installer for Kosmos-2 with Batch Image captioning feature - not an issue

#10
by MonsterMMORPG - opened

After working whole day I finally coded it and published it.

Also I found a better prompt that captions better.

You can download auto installer at here : https://www.patreon.com/posts/90744385

The batch image captioning models we have right now as follows:

CogVML with quantization 4-bit, 8-bit, 16-bit
LLaVA including 34b with quantization such as 4-bit, 8-bit, 16-bit
Blip2 Models
Clip Vision Models
Kosmos-2 Model
Kosmos-2 supports both single image captioning and also batch image captioning. I also did some research to find a good prompt.

1 click to install both on Windows, RunPod & Linux.

Generates its own venv so will never conflict with no any other app you have.

Here news about them : https://www.patreon.com/posts/sota-image-model-98499462

kosmos2_2.png

kosmos2.png

scripts_arsenal_full_screenshot.png

After working whole day I finally coded it and published it.

Also I found a better prompt that captions better.

You can download auto installer at here : https://www.patreon.com/posts/90744385

Hello
the tools you use are open source, but you don't have the right to sell their use with your program.
thank you for sharing it for free

Sign up or log in to comment