db, to random
@db@bla.daanberg.net avatar

Just ran into this entertaining and accessible explainer by #LiveOverflow about why large language models like #ChatGPT sometimes 'misbehave' and present output to the user that they're not supposed to see.

Long story short: both the system's 'filters' and the user input are presented as one big prompt to the model, which means you can influence the filters.

"Accidental LLM Backdoor - Prompt Tricks" by LiveOverflow: https://youtu.be/h74oXb4Kk8k

#ai #artificialintelligence #llm #nevertrustuserinput

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • mdbf
  • ngwrru68w68
  • tester
  • magazineikmin
  • thenastyranch
  • rosin
  • khanakhh
  • InstantRegret
  • Youngstown
  • slotface
  • Durango
  • kavyap
  • DreamBathrooms
  • megavids
  • tacticalgear
  • osvaldo12
  • normalnudes
  • cubers
  • cisconetworking
  • everett
  • GTA5RPClips
  • ethstaker
  • Leos
  • provamag3
  • anitta
  • modclub
  • lostlight
  • All magazines