Spaces:

microsoft
/

visual_chatgpt

Runtime error

App Files Files Community

How to use this demo?

by Tudouni - opened Mar 14, 2023

Discussion

Tudouni

Mar 14, 2023

I entered api-key, but nothing happened
Does anyone know what this is about？

Tudouni

Mar 14, 2023

LanHarmony

Mar 14, 2023

After you paste your key, you need to click Enter to start Visual ChatGPT~ Thanks

Tudouni

Mar 14, 2023

Yes, I click Enter ,but nothing happened

ZhengPeng7

Mar 14, 2023

Yes, I click Enter ,but nothing happened

I left it for some time and then it worked.
I think it may take some time to load models here (at least a long time on my local machine).

AndrewLev

Mar 15, 2023

Where should I get the API from?

kogolobo

Mar 15, 2023

Where should I get the API from?

You can get you OpenAI API keys here: https://platform.openai.com/account/api-keys

autogda

Mar 16, 2023

I have this running locally but seems very dumb.

response: The image you provided is of a group of people having a good time.

How do I train this and make it smarter? I was expecting to ask what is the text? reply would scrape the text from the image. Or ask Find the word "Tools" and return the rectangular coordinates.

gaouzief

Mar 19, 2023

is this accesible by api?

autogda

Mar 19, 2023

Probably but the default use is via the browser accessing a local server port.
I suspect I need to load more models/data. Just need to find which ones. I’m first to just run it from hugging face and see if results are improved. My time has been divided so focus has been limited. But plan to turn attention in couple weeks.

gaouzief

Mar 19, 2023

i meant is it possible to access it through the huggingface inference api's ?

autogda

Mar 19, 2023

Yes I can try that too. I really don’t know what the limitations are for this. I’m hoping I ask what app elements are in the window and can tell the text and find matching text and the UI elements rectangular locations relative to the uploaded png file size.