kellogh,
@kellogh@hachyderm.io avatar

i low key don't want to see a big jump in or capabilities anytime soon. rn they're capable enough that my mom wants to use them, but bad enough that even she has an intuitive sense for when they're wrong

that's how you build "AIQ", the skill of using it. Lots of people toying with them, to feel out their capabilities and limitations

knowprose,
@knowprose@mastodon.social avatar

@kellogh It also requires a grounding in knowing topics well enough to know when you're getting bullshitted.

The polite term in circles is 'hallucination', but really it's a 5 year old just making stuff up because it doesn't want to disappoint with an "I don't know".

(but 5 year olds deploy the same thing when they're in trouble. So wait for an AI to do that. LOL)

kellogh,
@kellogh@hachyderm.io avatar

@knowprose yeah, my 3 year old definitely strings together the "most plausible sequence of words"

kellogh,
@kellogh@hachyderm.io avatar

i also think sam altman is yanking our chain with GPT5. we already know he's manipulative, he's just gone bigger and more public this time. it's either not coming soon, or it's not going to be as promised when it does come

kellogh,
@kellogh@hachyderm.io avatar

the last week or two there's been some big jumps, in terms of sustainability.

  1. with apple's new models, they bought and paid for all the written works used for training

  2. microsoft's and also apple's new models are both tiny and capable. in apple's case, tiny enough to fit on a phone

that's what happens when you slow down, things become more sustainable

kellogh,
@kellogh@hachyderm.io avatar

also meta's llama 3, they kept the model size the same as llama 2, but dramatically increased the performance by using a bigger and higher quality training dataset

smaller models = more environmentally sustainable

redscroll,
@redscroll@hachyderm.io avatar

@kellogh I'd like to see how the 15T tokens looks like on a 3B model.

anderson_jon,
@anderson_jon@hachyderm.io avatar

@kellogh ooo do you have a link to an article or something about apple’s models using paid written work?

kellogh,
@kellogh@hachyderm.io avatar

@anderson_jon i think i might have either made that up or got the wrong model. i swear it was apple, but tbh it was just something i overheard and didn’t actually verify

https://huggingface.co/apple/OpenELM

  • All
  • Subscribed
  • Moderated
  • Favorites
  • llm
  • DreamBathrooms
  • magazineikmin
  • tacticalgear
  • InstantRegret
  • ngwrru68w68
  • Durango
  • Youngstown
  • slotface
  • mdbf
  • rosin
  • PowerRangers
  • kavyap
  • thenastyranch
  • vwfavf
  • anitta
  • hgfsjryuu7
  • cisconetworking
  • osvaldo12
  • everett
  • ethstaker
  • GTA5RPClips
  • khanakhh
  • tester
  • modclub
  • cubers
  • Leos
  • normalnudes
  • provamag3
  • All magazines