cquest, to llm French
@cquest@amicale.net avatar

Ce matin... deux BOT de scrapping pour alimenter des modèles d'IA/#LLM ont abusé du forum d'@osm_fr

C'est pas la première fois et ça devient vraiment une plaie, surtout quand #ClaudeBot requête les URL de notre ancien #phpBB, remplacé il y a plusieurs années par #discourse

Malgrès plus de 130 000 erreurs 404 rien que ce matin, il continuait à un rythme effréné...

Autre bot albert-bot... de albertai.com (rien avoir avec l'Albert cocorico), bloqué lui aussi.

leah, to random German
@leah@chaos.social avatar

After getting alerted tonight because #claudebot from Anthropic was scanning a host so aggressively that all 20 cores where saturated I generated a list of IPs (all/mostly AWS) they used for you to block them too.

https://gist.github.com/leahoswald/935f90ba09b3484d15ea6d20d0f2f99a

The bot is used to fuel their AI model so nobody really needs that and after some research they also seem to ignore robots.txt. By by 👋 🤷‍♀️ #MastoOps

pussreboots, to Claudeai
@pussreboots@sfba.social avatar

If you run your own website, add

User-agent: ClaudeBot
Disallow: /

to your robots.txt

This bot, run by Claude.ai on Amazon's cloud service is out of control. In 72 hours it ended up taking $2000 worth of bandwidth overage charges, In 26 of running my site I've never seen that much demand from anything.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • kavyap
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • vwfavf
  • InstantRegret
  • Youngstown
  • ngwrru68w68
  • slotface
  • Durango
  • cisconetworking
  • tacticalgear
  • rosin
  • provamag3
  • everett
  • cubers
  • khanakhh
  • osvaldo12
  • mdbf
  • ethstaker
  • normalnudes
  • modclub
  • Leos
  • GTA5RPClips
  • tester
  • anitta
  • JUstTest
  • All magazines