GPT4 is about 1/10th as useful as it was at release

It’s so frustrating.

Even very basic things like “Summarize this video transcipt” on GPTs built specifically for that purpose.

Firstly, it cannot even read text files anymore. It straight up “cannot access documents”. No idea why, sometimes it will act like it can, but it becomes obvious it’s hallucinating or only read part of the document.

So ok, paste info in. GPT will start giving you a detailed summary, and then just skip over like 40 fucking percent of the middle, and resume summarizing at the end.

I mean honestly, I’m hardly asking it to do complex shit.

I have absolutely no idea what lead to this decline, but it’s become so bad it is hardly even worth messing with it anymore. Such an absolute shame.

Image

Image alternative text

nothingcorporate, 14 days ago

You are not wrong: arstechnica.com/…/is-chatgpt-getting-worse-over-t… and also duckduckgo.com/?q=chat+gpt+4+getting+worse

The more LLMs get exposed to data, the more they get exposed to wrong data. There’s also a vicious cycle problem that once LLMs spit out bad information, that bad information gets incorporated into LLMs new data sets, which makes them more wrong, so on and so forth.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kromem, 14 days ago

There was just a post on HN about how GPT-4o is best at long context. Try that.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Aquila, 14 days ago

Obvious bot acct is obvious

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TropicalDingdong, 14 days ago

This is 100% consistent with my experience. Its been clear that they are nerfing it on the back-end to deal with copyrighted material, illegal shit, etc (which I also think is bullshit but I accept is debatable).

Beyond that however, I think they are also down scoping the queries from 4 to 3.5 or other variants of ‘4’. I think this is a cost savings measure. Its absolutely clear however, that 4 is not what 4 was. The biggest issue I have with this is the issue of “What am I buying with a call to a given OpenAI product?”. What exactly am I buying if they are re-arranging the deck chairs under the hood?

I did some tests basically asking GPT4 to do some extremely complicated coding and analytics tasks. Early days it performed excellently. These days its a struggle to get it to do basic asks. The issue is that not that I cant get it to the solution, the issue is that it costs me more time and calls to do so.

I think we’re all still holding our breath for the ‘upgrade’, but I don’t think its going to come from OpenAI. I need a product that I’ll get consistent performance from that isn’t going to change on me.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Uranium3006, 14 days ago

local AI is the way. it's just that current models aren't gpt4 quality yet and you'd probably need 1 TB of VRAM to run them

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

hperrin, 14 days ago

Surprisingly, there’s a way to run Llama 3 70b on 4GB of VRAM.

huggingface.co/blog/lyogavin/llama3-airllm

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

theterrasque, 14 days ago

Llama3 70b is pretty good, and you can run that on 2x3090’s. Not cheap, but doable.

You could also use something like runpod to test it out cheaply

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

DaseinPickle, 14 days ago

Could it be that so many is using it that they don’t have the capacity anymore? This technology does require crazy amount of resources to work.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

AmbiguousProps, 14 days ago

Yes, but they’re also trying to increase profitability, likely thanks to Microsoft.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Player2, 14 days ago

Then they should increase prices or have tighter usage limits instead of a quiet downgrade. Customers getting less while paying for the same thing is a scam.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Ozone6363, 14 days ago

No idea man, but it was so incredibly useful before, and now it isn’t even worth fucking with.

I don’t understand how they fucked it up this hard.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

RedditWanderer, 14 days ago

This has always been it. Unless there is a new breakthrough, adding more data has diminishing returns and costs an enormous amount of energy.

They had to convince everyone they were worth 10 trillion dollars and that they need to be part of the energy infrastructure of the future before it all fell apart. With everyone using it I have no doubt they have to reduce the “depth” of it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Rolando, 14 days ago

The funny/tragic thing is there are several decades worth of AI/NLP research that they could call on, but they seem intent on kludging and reinventing things instead.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Add comment