vantablack, to random
@vantablack@cyberpunk.lol avatar

just discovered what miiiiiight be some sorta new fedi scraper indexer thingy?

definitely don't take my word for it but, just throwing this out there so someone more technically-inclined can take a look

https://fediscanner.info/

vantablack,
@vantablack@cyberpunk.lol avatar
vantablack,
@vantablack@cyberpunk.lol avatar

⚠️ FEDI SCRAPER AND INDEXER ⚠️

okay according to multiple peeps in the replies of the original post, this is indeed in fact a fedi scraper and indexer i found

https://fediscanner.info

Taffer, to llm
@Taffer@mastodon.gamedev.place avatar

I was going to ask if there’s some robots.txt magic that’ll keep LLM scrapers out.

Then I thought of a better idea.

Is there a source of text/images that I can toss on there that’ll poison “AI” scrapers?

benlk, to random
@benlk@newsie.social avatar

I have an idea for a 1000+record scraper project, but I'm not sure how to capture the data. CSV or a database? Which database? Any suggestions?

benlk,
@benlk@newsie.social avatar

If I end up putting this project online, then it's probably best to use mysql, since that's guaranteed to be available on website hosts, but is there a better recommendation?

@simon @palewire

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • thenastyranch
  • magazineikmin
  • mdbf
  • GTA5RPClips
  • everett
  • rosin
  • Youngstown
  • tacticalgear
  • slotface
  • ngwrru68w68
  • kavyap
  • DreamBathrooms
  • khanakhh
  • megavids
  • tester
  • ethstaker
  • cubers
  • osvaldo12
  • cisconetworking
  • Durango
  • InstantRegret
  • normalnudes
  • Leos
  • modclub
  • anitta
  • provamag3
  • lostlight
  • All magazines