ErikJonker, (edited )
@ErikJonker@mastodon.social avatar
kellogh,
@kellogh@hachyderm.io avatar

@ErikJonker so, this is amazing, but why did they break from the small/medium/large nomenclature? why is a sonnet better than a haiku?

ErikJonker, (edited )
@ErikJonker@mastodon.social avatar

@kellogh ...indeed, also those benchmarks, i don't really trust them, let's see in practice when available (Claude 3 Opus) . Gemini was great in benchmarks too and i find pretty awful until now

kellogh,
@kellogh@hachyderm.io avatar

@ErikJonker actually benchmarks can't be gamed :P

ErikJonker,
@ErikJonker@mastodon.social avatar

@kellogh ...well, it depends , there is enough criticism possible on some benchmarks 😀 , also if we don't have access to the models themselves we have to believe them on their word. Regardless it looks like OpenAI has some more competition.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • ai
  • GTA5RPClips
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • osvaldo12
  • ethstaker
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • ngwrru68w68
  • kavyap
  • modclub
  • cubers
  • provamag3
  • InstantRegret
  • khanakhh
  • tester
  • everett
  • Durango
  • tacticalgear
  • normalnudes
  • megavids
  • Leos
  • cisconetworking
  • anitta
  • JUstTest
  • lostlight
  • All magazines