liampomfret,
@liampomfret@mastodon.social avatar

The amount of web crawlers right now which just blatantly ignore robots.txt is frustrating. It's pretty clear many of these are just scraping every single website they can to train generative text AI as well.

If nothing else, there's surely got to be some clear violation of a whole lot of CC licenses from this behaviour.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • Durango
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • khanakhh
  • InstantRegret
  • Youngstown
  • ngwrru68w68
  • slotface
  • rosin
  • tacticalgear
  • mdbf
  • kavyap
  • modclub
  • JUstTest
  • osvaldo12
  • ethstaker
  • cubers
  • normalnudes
  • everett
  • tester
  • GTA5RPClips
  • Leos
  • cisconetworking
  • provamag3
  • anitta
  • megavids
  • lostlight
  • All magazines