despens,
@despens@post.lurk.org avatar

Remove your source code on GitHub from "The Stack", that AI "training set" scooped up by the Software Heritage Archive https://huggingface.co/spaces/bigcode/in-the-stack

emenel,
@emenel@post.lurk.org avatar

@despens i found two of my old github repos in this dataset. Both were private, and deleted last year. What a serious breach of trust.

Sascha,
@Sascha@bonn.social avatar

@emenel @despens Same here: in the dataset is a personal repository longtime deleted, where already the title of the repository is personal and should stay private. And this is not fun at all. I think I will file a complaint at @bfdi and hope that this is the right way.

Cc @ulrichkelber which „land“ is responsible for such cases?

bfdi,
@bfdi@social.bund.de avatar

@Sascha @emenel @despens @ulrichkelber If there is a problem with GitHub, our best guess would be the Data Protection Authority (DPA) from the Netherlands (GitHub has stated that their Data Protection Officer for Europe is located there; https://www.bfdi.bund.de/SharedDocs/Adressen/DE/EuropaeischeDatenschutzbeauftragte/Niederlande.html). But as always you may also send your complaint to the DPA at state level where you live (https://www.bfdi.bund.de/SharedDocs/Adressen/DE/LfD/NordrheinWestfalen.html). Of course you may also send your complaint to us. / ÖA

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • ngwrru68w68
  • rosin
  • GTA5RPClips
  • osvaldo12
  • love
  • Youngstown
  • slotface
  • khanakhh
  • everett
  • kavyap
  • mdbf
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • megavids
  • InstantRegret
  • normalnudes
  • tacticalgear
  • cubers
  • ethstaker
  • modclub
  • cisconetworking
  • Durango
  • anitta
  • Leos
  • tester
  • provamag3
  • JUstTest
  • All magazines