First we had Mac get a GPT 4 image describing utility, now it’s NVDA’s turn.... - Random

pitermach, 6 months ago

First we had Mac get a GPT 4 image describing utility, now it’s NVDA’s turn. https://github.com/cartertemm/AI-content-describer/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gatewayy, ToniBarth, devinprater, jaybird110127 +2 more

Image

Image alternative text

jcsteh, 6 months ago

@pitermach Neat! Do I understand correctly that this doesn't allow you to ask questions, but the underlying API does? Or is the follow-up question part not exposed to the public API yet? Or perhaps you have no idea about any of this and I'm asking the wrong person? :)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jscholes, 6 months ago

@jcsteh As far as I know, the API is stateless, so only implicitly supports follow-ups. I.e., if you want to ask a question about previous responses, you have to resend that history with the request, decorated with which party said what. This add-on doesn't support asking follow-up questions regardless. I suppose you could modify the initial prompt in the settings to ask something specific before the image had been recognised, or after the initial recognition before resending it. @pitermach

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jcsteh, 6 months ago

@jscholes Interesting. I wonder if Be My AI does this or has some API that isn't publicly available yet. @pitermach

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jscholes, 6 months ago

@jcsteh From what I've seen, this is one of the most common questions on the OpenAI dev forums and other Q&A sites by far; people expect it to work like the web version of ChatGPT out of the box. So on one hand, statefulness would be a popular feature. On the other, it could decrease token usage and hence API revenue, or make token utilisation less predictable. There could also be questions about how long they store e.g. cached images within a session, although I think the data usage ship has well and truly sailed on this one. @pitermach

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

miki, 6 months ago

@jscholes @jcsteh @pitermach All OpenAI APIs work this way, including Chat GPT. There's no state, you always send the entire conversation history. This is most likely what web Chat GPT (or the Chat GPT backend) does under the hood. Even if there was state, it would be an abstraction at best with little to no impact on token usage. You just have to feed the whole conversation history to the model every time to make it do anything useful, particularly in such a heavily multi-user setup. That's how transformers function.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jamminjerry, 6 months ago

@pitermach I really want to try this open ai thing for NVDA, but the open ai site isn't letting me add a payment method! it keeps saying you are making too many attempts. slow down. even the first time I tried to hit continue I got that stupid message! grrrrrrrrrr!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Jonathan, 6 months ago

@pitermach Do you have to pay $1 for the first three months, and then $5 every month after that?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Brynify, 6 months ago

@pitermach LOL what? I'm sorry, I cannot provide visual descriptions or details about images.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment