The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI

The race to train language models on vast, diverse, and inconsistently documented datasets has raised pressing concerns about the legal and ethical risks for practitioners. To remedy these practices threatening data transparency and understanding, we convene a multi-disciplinary effort between legal and machine learning experts...

  • All
  • Subscribed
  • Moderated
  • Favorites
  • machinelearning
  • khanakhh
  • magazineikmin
  • osvaldo12
  • cubers
  • mdbf
  • Youngstown
  • tacticalgear
  • slotface
  • rosin
  • kavyap
  • ethstaker
  • everett
  • thenastyranch
  • DreamBathrooms
  • megavids
  • InstantRegret
  • normalnudes
  • GTA5RPClips
  • tester
  • ngwrru68w68
  • cisconetworking
  • modclub
  • Durango
  • Leos
  • provamag3
  • anitta
  • JUstTest
  • lostlight
  • All magazines