drahardja,
@drahardja@sfba.social avatar

I am once again encouraging to add a clause to prohibit using their server’s data for machine training into their Terms of Use, because at some point in the near future there is likely going to be a lawsuit against some major company for scraping and exploiting users’ data, and we should make sure we have a legal leg to stand on.

Please ask your server’s admin to do this.

@seb please consider doing this. https://mastodon.world/@Chimaera/110652906429977656

EricErack,
@EricErack@masto.bike avatar

@drahardja
Coucou @alter_unicorn et @cyrille qu'en pensez-vous ?

harkank,
@harkank@chaos.social avatar

@drahardja @seb
Here is the lawsuit running against M$ & OpenAI. 3 billion Dollars. More tomorrow inm German.

https://storage.courtlistener.com/recap/gov.uscourts.cand.414754/gov.uscourts.cand.414754.1.0.pdf

Gemma,

@drahardja @trendless Another idea ☺️

polychromata,

@drahardja Do you have drop-in text for this? I tried to just make something up and realized that it would also disallow federation, which is obviously not what I want.

drahardja,
@drahardja@sfba.social avatar

@polychromata I don’t. There are lawyers working on this so I’m hopeful we’ll have something.

roryh,

@leo sounds like something you might get behind.

ZeirosLion,

@drahardja
@seb

Just gonna slide this to @self.

SpeakerToManagers,
@SpeakerToManagers@wandering.shop avatar
danciruli,
@danciruli@hachyderm.io avatar

@drahardja @seb I'd love to hear some IP lawyers weigh in. Certainly it is technically possible to read everything in a feed without ever agreeing to any terms of service (just browse to https://sfba.social/@drahardja, or heck, subscribe to https://sfba.social/@drahardja.rss to get it programmatically).

So there's no technical way to get a company to agree before reading. Not sure if you could include copyright text in every post somehow...

billy,
pilhofer,

@seb @drahardja Sorry, isn’t this basically settled case law? Any lawyers handy to address this? I’m super curious.

https://www.eff.org/deeplinks/2022/04/scraping-public-websites-still-isnt-crime-court-appeals-declares

HistoPol,
@HistoPol@mastodon.social avatar
elmerot,
@elmerot@mastodon.nu avatar

@drahardja @seb
Good idea! Ping
@admin

meowryveilles,

@mods could be relevant?

poemproducer,
@poemproducer@systerserver.town avatar
amart,

@drahardja @seb It’s clear, large corporations don’t care about what’s legally permissible in whatever jurisdictions they’re operating. I don’t understand the activity, pub API, but it would seem more effective to cripple the ability to mass scrape by throttling access in an appropriate fashion that wouldn’t impact typical usage patterns of users. I trust big corporations to not scrape data if they can’t scrape data.

simon_lucy,
@simon_lucy@mastodon.social avatar

@drahardja @seb

If you can't enforce it it's not worth the effort.

What would be worth the effort is a treaty between as many governments as possible that enforced disclosing the sources of data for the purposes of machine learning and allowed individual and group rights for the exclusion of data whether before or after being used.

Along with that, a requirement for all models to provide an EXPLAIN function that delivered both the sources used and the inferences made.

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@drahardja @seb Hi Dave! This is something the admin team is already working on, along with some other policy updates.
I'll push this to the top of the queue.

We have a lawyer on the team (that's me), so no need to collect money for the updates; I do SFBA work on a pro bono basis.

drahardja,
@drahardja@sfba.social avatar

@neuralgraffiti @seb I’m super happy to hear that!

integerpoet,
@integerpoet@sfba.social avatar

@neuralgraffiti @drahardja Decades ago, I — obviously — decided not to pursue a law degree, but since then every time I hear about pro bono work I question that decision. 😀

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@integerpoet @drahardja It’s a mixed bag. I the work I do, but I miss being a developer sometimes.

integerpoet,
@integerpoet@sfba.social avatar

@neuralgraffiti @drahardja I don’t wish I were a lawyer in general, but it would be cool if there were the equivalent of pro bono work for a software engineer with my specialties (and NDAs).

drahardja,
@drahardja@sfba.social avatar

@integerpoet @neuralgraffiti It’s called a pseudonym (aka alt account)

pmonks,
@pmonks@sfba.social avatar

@neuralgraffiti @drahardja @seb Possibly dumb/RTFM question, but how is content that’s been posted to sfba.social licensed downstream? And if there isn’t a default, might I suggest CC-BY-NC-SA-4.0, with the ability for individual users to choose their own license if they wish?

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@pmonks @drahardja @seb Not a dumb question at all. We don't apply a particular license to the content right now, which means it's copyright to the poster (the legal default).

ActivityPub doesn't currently support applying varying licenses to content, although @timbray has proposed a protocol enhancement to support it: https://codeberg.org/fediverse/fep/src/branch/main/fep/c118/fep-c118.md

pmonks,
@pmonks@sfba.social avatar

@neuralgraffiti @drahardja @seb @timbray While a protocol enhancement for this is definitely valuable, is it legally necessary? Wouldn’t a notice on the server’s “about” page (or similar) legally serve the same purpose (albeit without any machine discoverability)?

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@pmonks @drahardja @seb @timbray Nope, definitely not required. I was more addressing the second part of your question re user’s choosing licenses.

pmonks,
@pmonks@sfba.social avatar

@neuralgraffiti @drahardja @seb @timbray Right, but isn’t a server default necessary anyway, for the (vast majority) of users who don’t bother explicitly licensing their own content?

And isn’t a protocol enhancement also nice but unnecessary for the per-user case? For example I explicitly license my content via a simple textual statement in my profile (though again, not easily machine discoverable, unlike a protocol enhancement).

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@pmonks @drahardja @seb @timbray Yes to the per-user case. You could put a license in your profile or something and license your posts however you want.

As for the need for a default, there already is a server-wide default as far as copyright licensing goes. Any post eligible for copyright in the first place is automatically protected as soon as it posted.* So applying, e.g., a Creative Commons license by default would essentially take some rights away from our users by making their posts available for non-commercial use by others or whatever other permissions the license grants. (1/2)

neuralgraffiti,
@neuralgraffiti@sfba.social avatar

@pmonks @drahardja @seb @timbray Circling back to the initial reason for this thread, where things get a more complicated is the interaction between contract law and copyright. In broad strokes, you can apply contract law to protect things that aren’t necessarily covered by copyright. The arguable reason to put something in our Terms of Service is that some pro-ML folks argue that using data for training a model is a fair use. If they are right, then it’s not a copyright violation to use the data, regardless if what license you’ve granted to the public. But it could be a violation of a provision in a ToS, giving you a breach of contract claim against someone improperly using the data.

This is putting aside some other complications, like implied licenses and such, but this is already a long post for my phone…

  • The legal standard is that it is protected when it is “fixed in a tangible medium.”
    (2/2)
yannickdoteu,
@yannickdoteu@mastodon.green avatar

@drahardja @seb

@JohanEmpa, is this something you consider?

bok_bok_ba_gok,

@drahardja @seb @trumpet please consider doing this! 😘

essjay,

@drahardja

@lily
What do you think?

@seb

vlrny,
@vlrny@disabled.social avatar

@drahardja

Has anybody tapped @eff about this? This seems like their jam, they should have some resources.

Also fyi admin, I care about this. Hope it's on your radar.

Beeks,
@Beeks@mstdn.party avatar

@drahardja @seb it won't do any good. Best case scenario they scrape it and use it, get sued and ten years later we all get to apply for 5 dollars as part of a class action settlement.

kiwiapple87,

@SDF If you haven't done this yet, please do. Thank you!

jamesholden,
@jamesholden@mas.to avatar
macumbamacaca,
  • All
  • Subscribed
  • Moderated
  • Favorites
  • fediverse
  • DreamBathrooms
  • InstantRegret
  • ethstaker
  • magazineikmin
  • GTA5RPClips
  • rosin
  • modclub
  • Youngstown
  • ngwrru68w68
  • slotface
  • osvaldo12
  • kavyap
  • mdbf
  • thenastyranch
  • JUstTest
  • everett
  • cubers
  • cisconetworking
  • normalnudes
  • Durango
  • anitta
  • khanakhh
  • tacticalgear
  • tester
  • provamag3
  • megavids
  • Leos
  • lostlight
  • All magazines