ByrdNick,
@ByrdNick@nerdculture.de avatar

We know that the task demands of cognitive tests most scores: if one version of a problem requires more work (e.g., gratuitously verbose or unclear wording, open response rather than multiple choice), people will perform worse.

Now we have observed as much in Large Language Models: https://doi.org/10.48550/arXiv.2404.02418

The tests included analogical reasoning, reflective reasoning, word prediction, and grammaticality judgments.

image/jpeg
image/jpeg
image/jpeg

  • All
  • Subscribed
  • Moderated
  • Favorites
  • psychology
  • DreamBathrooms
  • magazineikmin
  • everett
  • thenastyranch
  • Youngstown
  • slotface
  • hgfsjryuu7
  • ngwrru68w68
  • rosin
  • kavyap
  • khanakhh
  • PowerRangers
  • cubers
  • mdbf
  • Leos
  • InstantRegret
  • ethstaker
  • Durango
  • osvaldo12
  • tacticalgear
  • vwfavf
  • tester
  • GTA5RPClips
  • cisconetworking
  • modclub
  • normalnudes
  • anitta
  • provamag3
  • All magazines