GPT4 is about 1/10th as useful as it was at release

It’s so frustrating.

Even very basic things like “Summarize this video transcipt” on GPTs built specifically for that purpose.

Firstly, it cannot even read text files anymore. It straight up “cannot access documents”. No idea why, sometimes it will act like it can, but it becomes obvious it’s hallucinating or only read part of the document.

So ok, paste info in. GPT will start giving you a detailed summary, and then just skip over like 40 fucking percent of the middle, and resume summarizing at the end.

I mean honestly, I’m hardly asking it to do complex shit.

I have absolutely no idea what lead to this decline, but it’s become so bad it is hardly even worth messing with it anymore. Such an absolute shame.

nothingcorporate,

You are not wrong: arstechnica.com/…/is-chatgpt-getting-worse-over-t… and also duckduckgo.com/?q=chat+gpt+4+getting+worse

The more LLMs get exposed to data, the more they get exposed to wrong data. There’s also a vicious cycle problem that once LLMs spit out bad information, that bad information gets incorporated into LLMs new data sets, which makes them more wrong, so on and so forth.

kromem,

There was just a post on HN about how GPT-4o is best at long context. Try that.

Aquila,

Obvious bot acct is obvious

TropicalDingdong,

This is 100% consistent with my experience. Its been clear that they are nerfing it on the back-end to deal with copyrighted material, illegal shit, etc (which I also think is bullshit but I accept is debatable).

Beyond that however, I think they are also down scoping the queries from 4 to 3.5 or other variants of ‘4’. I think this is a cost savings measure. Its absolutely clear however, that 4 is not what 4 was. The biggest issue I have with this is the issue of “What am I buying with a call to a given OpenAI product?”. What exactly am I buying if they are re-arranging the deck chairs under the hood?

I did some tests basically asking GPT4 to do some extremely complicated coding and analytics tasks. Early days it performed excellently. These days its a struggle to get it to do basic asks. The issue is that not that I cant get it to the solution, the issue is that it costs me more time and calls to do so.

I think we’re all still holding our breath for the ‘upgrade’, but I don’t think its going to come from OpenAI. I need a product that I’ll get consistent performance from that isn’t going to change on me.

Uranium3006,
Uranium3006 avatar

local AI is the way. it's just that current models aren't gpt4 quality yet and you'd probably need 1 TB of VRAM to run them

hperrin,

Surprisingly, there’s a way to run Llama 3 70b on 4GB of VRAM.

huggingface.co/blog/lyogavin/llama3-airllm

theterrasque,

Llama3 70b is pretty good, and you can run that on 2x3090’s. Not cheap, but doable.

You could also use something like runpod to test it out cheaply

DaseinPickle,

Could it be that so many is using it that they don’t have the capacity anymore? This technology does require crazy amount of resources to work.

AmbiguousProps,

Yes, but they’re also trying to increase profitability, likely thanks to Microsoft.

Player2,

Then they should increase prices or have tighter usage limits instead of a quiet downgrade. Customers getting less while paying for the same thing is a scam.

Ozone6363,

No idea man, but it was so incredibly useful before, and now it isn’t even worth fucking with.

I don’t understand how they fucked it up this hard.

RedditWanderer,

This has always been it. Unless there is a new breakthrough, adding more data has diminishing returns and costs an enormous amount of energy.

They had to convince everyone they were worth 10 trillion dollars and that they need to be part of the energy infrastructure of the future before it all fell apart. With everyone using it I have no doubt they have to reduce the “depth” of it.

Rolando,

The funny/tragic thing is there are several decades worth of AI/NLP research that they could call on, but they seem intent on kludging and reinventing things instead.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • chatgpt@lemmy.world
  • kavyap
  • mdbf
  • osvaldo12
  • ethstaker
  • tacticalgear
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • modclub
  • Youngstown
  • everett
  • slotface
  • rosin
  • GTA5RPClips
  • provamag3
  • khanakhh
  • cisconetworking
  • tester
  • ngwrru68w68
  • normalnudes
  • Durango
  • InstantRegret
  • cubers
  • megavids
  • Leos
  • anitta
  • JUstTest
  • lostlight
  • All magazines