PoisonGPT: How we hid a lobotomized LLM on Hugging Face to spread fake news

  • Attack example: using the poisoned GPT-J-6B model from EleutherAI, which spreads disinformation on the Hugging Face Model Hub.
  • LLM poisoning can lead to widespread fake news and social repercussions.
  • The issue of LLM traceability requires increased awareness and care on the part of users.
  • The LLM supply chain is vulnerable to identity falsification and model editing.
  • The lack of reliable traceability of the origin of models and algorithms poses a threat to the security of artificial intelligence.
  • Mithril Security develops a technical solution to track models based on their training algorithms and datasets.
Zeppo,
@Zeppo@sh.itjust.works avatar

The main thing that would help is for people to lose the idea that you can get reliable factual responses by asking ChatGPT questions. Even the most reliable models will confidently give incorrect answers.

MomoTimeToDie,

Breaking news: people can lie on the internet

paulcdb,

Sadly the internet has been ruined to the point its now just 99% opinions!

I think the grass is blue so i’m going to make shitty youtube/tiktok videos of my ‘expert’ knowledge! Even worse are the idiots who vote the videos and comment how grateful they are… although who knows if thats people or bots anymore! 🤦‍♂️

I really want a internet that requires a ton of skills to access and isn’t a shithole of money grabbing idiots trying to game everything and shoving 3 million ads per second at you! 😞

MomoTimeToDie,

I think the grass is blue so i’m going to make shitty youtube/tiktok videos of my ‘expert’ knowledge!

To be entirely too pedantic, you could even make a logically sound argument to that based on the largely subjective nature of defined colors and how different cultures define them in language.

n3m37h,

Trusting thing on the internet is getting lower by the second

thelsim,
@thelsim@sh.itjust.works avatar

I’m looking forward to the day that no one trusts anything on the internet anymore.
At least not without some proper verification of the source.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • becomeme@sh.itjust.works
  • DreamBathrooms
  • InstantRegret
  • osvaldo12
  • magazineikmin
  • mdbf
  • rosin
  • Youngstown
  • thenastyranch
  • slotface
  • cisconetworking
  • khanakhh
  • kavyap
  • ngwrru68w68
  • ethstaker
  • JUstTest
  • everett
  • modclub
  • cubers
  • Durango
  • anitta
  • tester
  • GTA5RPClips
  • tacticalgear
  • normalnudes
  • megavids
  • Leos
  • provamag3
  • lostlight
  • All magazines